Gene TM1040_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2638 
Symbol 
ID4077941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2771466 
End bp2772980 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content59% 
IMG OID638007962 
Producthypothetical protein 
Protein accessionYP_614632 
Protein GI99082478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTT CTTTTGTTCG CAAGCTGGGC GCCGTCGCGC CGGTTCTGCT GATTGCCCAG 
GCGGCACATG CTGACGTCAC GGCACAGGAC GTCTGGGCCG ATTGGAAGGA CTACATGTCC
AGCACCGGCT ACGAGATCAC CGCAACCGAA ACCGAAAGCA GCGGCAAGCT CACCGTTTCC
GACATCACCA TGTCGATGGA TGTCGAGGGC GACTCCTTCT CCATGACCAT GGGATCGCTC
GATTTCATCG AGAACGGCAA TGGCACCGTG AACGTCGTGA TGCCCACCAC TTTCCCGATG
TCCTTCAACG TCAACGCAGA TGGCGATGCG ATCTCTGGGG ATCTCCTCTA TACCCACGAC
GGCAGCCCGA TGGTCGTGAG CGGCGACACC TCGGAAATGG CGTATAACTA CACCGGTGCG
ACCTCCGCCA TCAGCCTCGC CAACCTGGTG ATCGACGGCG ATGCTGTGCC CGCAGACGCG
ATCATGCTCA ATGTCAACGT GACCGATATG GTCAGCAACA CGCTGATGAA GATCGGCGAT
GTGCGCTCTT ATTCGCAGAC CATGACCATG GCCTCGGTTG CCTATGACTT CATGTTCCAG
GAGCCCGAAG GGGACGATGG CGCAGCCTTC AACGGCGCGC TGCAGGGGCT TGAGTTCACC
GGTGACATGA CCATCCCCGA GGTCGACGAT CCCAGCGATT TGGCCGCGAT GCTCAAGGCG
GGCATGGCCT ATGTGGGTGG CTTCACCTTC GAATCCGGCA ACACCAATGT CAAAGGCAGC
GACGGTCCGG ATAATTTCGA CTTTCAGACC TCCTCAAATG GCGGCGCTAT CAACATCGGA
CTGAGCCCCG CCGGCTTGAA CTATGACGTG ACCCAGCGTG ACACGACGCT CAACATGATG
GGGTCCGACA TTCCGTTCCC GGTCTCGCTC AGCATGAAAG AAATGGGCAT GAACTTTGCC
ATGCCGCTGA CCAAGTCCGA CGAGGAGCAA GATTTTGCCC TCGGGATCGC GCTGCGTGAA
TTTGCGGTGC CGGACATGCT CTGGGGTCTG ATCGACCCGG CGGGCGAGCT GCCGCGCGAC
CCTGCAAACC TCGTTGTGGA TCTCTCCGGC AAGGGCAAGC TGTTCTTTGA CCTCGTGGAC
GAAGAGCAAA TGGCCGCCGT GGAATCCGGT GAAGAAATGC CGGGCGAGGT AAACTCCCTC
AGCATCAACG AGATCCTGCT CTCGGTTGCG GGTGCCGAAC TGACAGGTGA CGGCGCCTTC
ACCTTTGACA ACACCGACCT CGAGAGCTTT GACGGCATGC CCGCCCCCGA AGGCGAAGCC
AGCCTGAAGT TGGTGGGTGC AAACGCCCTG ATCGACAAGC TGATCGGCAT GGGCCTCGTC
TCCGAGGATG ACGCCATGGG CGCGCGCATG ATGATGGGCA TGTTCACGGT TCCAGCAGGC
GACGACACCG TGACCTCCAA GATCGAAGTC AACGAAGAAG GCCATGTGCT CGCCAACGGC
CAGCGCCTGA AGTAA
 
Protein sequence
MSISFVRKLG AVAPVLLIAQ AAHADVTAQD VWADWKDYMS STGYEITATE TESSGKLTVS 
DITMSMDVEG DSFSMTMGSL DFIENGNGTV NVVMPTTFPM SFNVNADGDA ISGDLLYTHD
GSPMVVSGDT SEMAYNYTGA TSAISLANLV IDGDAVPADA IMLNVNVTDM VSNTLMKIGD
VRSYSQTMTM ASVAYDFMFQ EPEGDDGAAF NGALQGLEFT GDMTIPEVDD PSDLAAMLKA
GMAYVGGFTF ESGNTNVKGS DGPDNFDFQT SSNGGAINIG LSPAGLNYDV TQRDTTLNMM
GSDIPFPVSL SMKEMGMNFA MPLTKSDEEQ DFALGIALRE FAVPDMLWGL IDPAGELPRD
PANLVVDLSG KGKLFFDLVD EEQMAAVESG EEMPGEVNSL SINEILLSVA GAELTGDGAF
TFDNTDLESF DGMPAPEGEA SLKLVGANAL IDKLIGMGLV SEDDAMGARM MMGMFTVPAG
DDTVTSKIEV NEEGHVLANG QRLK