Gene TM1040_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3153 
Symbol 
ID4075323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp133822 
End bp135075 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content61% 
IMG OID638004656 
Productextracellular solute-binding protein 
Protein accessionYP_611389 
Protein GI99078131 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000182033 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTA AGTTTATGAT GGCCGCGCTG ACGGGCACTG CCCTGGTGGC CACTTCCGCA 
CTGGCCGAGG ATGTCACCCT CACTGTCGAA AGCTGGCGCA ATGACGACCT GACGCTCTGG
CAGGACAAGA TCATCCCCGC GTTCGAAGCC GCAAACCCCG GCATCAAGGT GAAATTCACC
CCCAGCGCGC CGACCGAATA CAACGCGGTC CTGAACTCCA AGCTGGACGC AGGCTCTGCT
GGTGATCTGA TCACCTGCCG CCCGTTTGAC GCCTCGCTTG CGCTCTATGA GGCGGGCCAC
CTCGCCGCGC TGGATGATAT GGACGCGATG AGCAACTTCT CTGACGTCGC CAAATCCGCA
TGGCAGACCG ACGATGGCTC CGCGAGCTTC TGTGTGCCGA TGGCCTCCGT GATCCACGGC
TTTATCTACA ACAAAGAGGC CTTCGAAGAG CTCGGCCTTG AGGTTCCGAC CACCGAAGAC
GAATTCTTTG CCGCGCTTGA GACCATCAAG GAAGACGGCA GCTATATCCC GATGGCGATG
GGCACCAACG ACCAGTGGGA AGCCGCCACC ATGGGCTATA ACAACATCGG CCCGAACTAC
TGGAAAGGCG AAGAAGGCCG TCGCGCCCTG ATCGCGGGCG AGCAGAAGCT CACCGACGAA
CAATGGGTTG CCCCCTATGC GACCCTCGCC AAATGGGCGG ATTATCTGGG CGACGGCTAT
GAGGCGCAGA CCTATCCTGA CAGCCAGAAC CTCTTCACGC TGGGCCGCGC GGCGATCTAT
CCGGCAGGCA GCTGGGAAAT TTCTGGCTTC AACGCGCAAG CCGATTTTGA AATGGGCGCC
TTCAAGGCTC CGGTCAAATC CGCAGGCGAC ACCTGCTATA TCTCGGACCA CACCGACATT
GGTATTGGCA TGAACGCCTC CACCGAGCAC CCCGAAGCCG CCAAGGCCTT CCTCGCCTGG
GTCGCATCGC CCGAGTTCGC GGACATCTTC GGCAACGCTC TGCCGGGCTT CTTCCCGCTC
TCCAATGCGC CGGTTGAGCT CGAAGATCCG CTGGCCAAGG AATTTGTAAG CTGGCGTGGC
GAGTGCGAGA GCACCATCCG CTCCACCTAC CAGATCCTGT CGCGCGGCAC GCCGAACCTC
GAAAACGAGA CCTGGGGCGC ATCCGTTGCC GCAATCAAAG GCACCGAAAC GCCCGAAGCT
CTGGGCGAAA AACTCCAGTC GGGTCTCGCA ACCTGGTACG AACCGCAACA GTAA
 
Protein sequence
MKSKFMMAAL TGTALVATSA LAEDVTLTVE SWRNDDLTLW QDKIIPAFEA ANPGIKVKFT 
PSAPTEYNAV LNSKLDAGSA GDLITCRPFD ASLALYEAGH LAALDDMDAM SNFSDVAKSA
WQTDDGSASF CVPMASVIHG FIYNKEAFEE LGLEVPTTED EFFAALETIK EDGSYIPMAM
GTNDQWEAAT MGYNNIGPNY WKGEEGRRAL IAGEQKLTDE QWVAPYATLA KWADYLGDGY
EAQTYPDSQN LFTLGRAAIY PAGSWEISGF NAQADFEMGA FKAPVKSAGD TCYISDHTDI
GIGMNASTEH PEAAKAFLAW VASPEFADIF GNALPGFFPL SNAPVELEDP LAKEFVSWRG
ECESTIRSTY QILSRGTPNL ENETWGASVA AIKGTETPEA LGEKLQSGLA TWYEPQQ