Gene TM1040_3437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3437 
Symbol 
ID4075611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp461879 
End bp463357 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content62% 
IMG OID638004946 
Productextracellular solute-binding protein 
Protein accessionYP_611671 
Protein GI99078413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.58601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.852102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCA AAAAACCCAT TCTGGCGGCC GTCTCTGCGC TCGCCCTTCT GGCGGGGCCT 
GCGCTGGCCA AGGACACCGT GACCTACGCC ACCCAGCTGG AGCCGCCGCA TCTCGATCCC
ACTGGCGGCG CGGCGCAAGC CATCGATACG GTGGTGTATC TCAATATCTT CGAGGGCCTC
ACGCGCTTTA CCCCGGATGG GGCTGTGGTG CCGCTTCTTG CGAAATCCTG GGAGATTTCC
GAGGACGGTC GGACTTATAC GTTCACTCTG CAAGAGGGCG TCACCTTTCA CGACGGCAGC
ACGCTTGATG CGCAGGATGT GAAATTCTCG CTCGATCGCG CCCGCGCCGA GGACAGCACC
AACGCCCAGA AGGCGCTCTT TGCGGACATT GCGGACGTCA CAGTGAGTGA TGCGCAAACC
GTGGTGGTGA CGCTCTCCGA GCCCAACGGC AATTTCCTCT TCAACCTCGC CTGGGGCGAT
GCGGTGATCG TGGCCGAAGA GTCGGTCGAG ACCCTGAAAA CTGCTCCTGT GGGCACCGGC
CCCTACCGCT TTGGCGAATG GGTGCAGGGA GACCGGGTAG AGATGGTGCG CAACCCGAAT
TACTGGGGCG AGATCCCCGA GCTGACGGGC GCCACGATCA AGTTCATCTC CGACCCCACC
GCCGCCTTTG CCGCGATGAT GGCCGAAGAC ATTGACGCCT TTGATAATTT CCCGGCGCCA
GAGAATATGA TCCAGTTCGA GGCCGATCCG CGTTTTCAGG TGATCGTTGG CTCCACCGAA
GGCGAGACGA TCCTGTCGAC GAACAACGCC CAGGCACCCT TTGACAACCC CAAAGTGCGT
CAGGCGCTGG CCCATGCGAT TGATCGTCAG GCCATCGTGG ATGGTGCGAT GTTTGGCTAT
GGCACGCCGA TTGGCACCCA TTTTGCGCCG CATAACCCGG CCTATGTGGA TCTCACCGGT
CAGTCCGATT TTGACCCCGA CAAAGCCCGC GCGCTTCTGG CCGAAGCGGG CCTTGCAGAT
GGGTTCACCA CCACGCTGCA CCTGCCGCCC CCCGCCTATG CCCGTCGCGG TGGCGAGATC
GTGGCCGCGC AGCTGGCCCA GGTGGGTATC ACCGCCGAGA TCATCAATGT GGAATGGGCG
CAGTGGCTTG AGACCGTGTT CAAAGGCAAG ACCTATGGTC TCACGATCGT CAGCCACACC
GAGCCGATGG ACATCGGGAT CTATGGCCGT CCGGATTATT ACTTCCAGTA TGACAATCCG
GAGTTCCAGG GCGTGATGAG CCGCCTCAAC GCGACCACCG ACCCGGACCA GCGTACGGCG
CTCCTGCAGG ACGCCCAGCG CATGATCGCG GATGACTATG TTAACGGCTA CCTGTTCCAG
CTGGCAAAGC TCGGCGTTGC CAAAGCCGGC CTTGAGGGCA TCTGGGCCAA TGCTCCGGCC
GCTGCCATCG AAATCGGGGC GCTCAGCTGG GCTGAGTAA
 
Protein sequence
MNFKKPILAA VSALALLAGP ALAKDTVTYA TQLEPPHLDP TGGAAQAIDT VVYLNIFEGL 
TRFTPDGAVV PLLAKSWEIS EDGRTYTFTL QEGVTFHDGS TLDAQDVKFS LDRARAEDST
NAQKALFADI ADVTVSDAQT VVVTLSEPNG NFLFNLAWGD AVIVAEESVE TLKTAPVGTG
PYRFGEWVQG DRVEMVRNPN YWGEIPELTG ATIKFISDPT AAFAAMMAED IDAFDNFPAP
ENMIQFEADP RFQVIVGSTE GETILSTNNA QAPFDNPKVR QALAHAIDRQ AIVDGAMFGY
GTPIGTHFAP HNPAYVDLTG QSDFDPDKAR ALLAEAGLAD GFTTTLHLPP PAYARRGGEI
VAAQLAQVGI TAEIINVEWA QWLETVFKGK TYGLTIVSHT EPMDIGIYGR PDYYFQYDNP
EFQGVMSRLN ATTDPDQRTA LLQDAQRMIA DDYVNGYLFQ LAKLGVAKAG LEGIWANAPA
AAIEIGALSW AE