Gene TM1040_3635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3635 
Symbol 
ID4075063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp691965 
End bp693041 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content58% 
IMG OID638005155 
Productextracellular solute-binding protein 
Protein accessionYP_611864 
Protein GI99078606 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.248377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA AGCTTGCGCA ATTCAGCCGT ATCGCGGTGG TCACAACCGC GATGGCAACA 
CTACCGATGA TGGCAGGGGC GGAAACGCTG CGCCTTCTGA CTTGGGGCGG CTACGCGCCC
GAAGACGTCA TTGCGAAATT CGAAGAAGAA ACCGGCCACA CAGTCGAAGT GACCACCTCG
AACAACGAAG AGATGATCGC AAAGTTGCGC GCCACCAATG GCGGCGGTTT TGATCTGGCC
CAGCCGAGCC AGGACCGCAT CACCAGCGCG CAGGAAGAGT TCGGCATCTA CAAGCCGATC
GATATGTCCC GCATCAATGC GGATCTGTTC ATTCCGTCGA TGCTGAGCGC GACCGCTGCA
AACACGACCT TTGAGGGTGA AGTCTACGGC GTACCGCATG TCTGGGGCAC CAGCGGTCTT
GTGGTGAATA CCGAGATGGC AGGCAATGTG CAGGACTACA GTGATCTTTG CGACGACTCG
GTTGCAGGCA AGGTTTCTTA TCGTTTGAAG CGCCCGACTC TGATTGGTTT CGCCTATTCC
ATGGGTCTGG ACCCGTTTGC GGCCTATGGC GATAGCGCTG CTTATCAGGG GATCCTCGAT
CAGGTCGAAG CGAAACTCAC CGAGTGTAAA GCCAACGTCA AAACCTATTG GGATGGTGGC
GACGAGATCA AAAACCTGCT GCGCTCCGGC GAAGTTGTGG CGTCCATGGC CTGGGATACC
GGTGGCTGGC AGCTCAACGC TGACAACCCC GATATCACCT TTGTTGCACC AAAGTCCGGT
GCGCTGGGTT GGATCGACAC CTTTGTTCTG CCTGCCCGTG GCCGTGCAGA TGATGCGGCC
TATGACTGGA TCAACTTTGT GATGCGCCCG GAAATCGCGG CGATGATCAC CAACACCGCC
GGGAACTTCA CTGCAGCGGT TGATGGTGAT GCAGCTGTCG ATGCGGACCT CAAAGCGCGC
TACCAGAGCA GCTTTGACCA GCAGGCGATC GACAACATCA AGTGGTATCC CCCGGTGCCC
GCAGGTCTCG AAGCGATGGA AGGGGCAAGC CTCGACCGGA TCAACGCGGC CAACTAA
 
Protein sequence
MTDKLAQFSR IAVVTTAMAT LPMMAGAETL RLLTWGGYAP EDVIAKFEEE TGHTVEVTTS 
NNEEMIAKLR ATNGGGFDLA QPSQDRITSA QEEFGIYKPI DMSRINADLF IPSMLSATAA
NTTFEGEVYG VPHVWGTSGL VVNTEMAGNV QDYSDLCDDS VAGKVSYRLK RPTLIGFAYS
MGLDPFAAYG DSAAYQGILD QVEAKLTECK ANVKTYWDGG DEIKNLLRSG EVVASMAWDT
GGWQLNADNP DITFVAPKSG ALGWIDTFVL PARGRADDAA YDWINFVMRP EIAAMITNTA
GNFTAAVDGD AAVDADLKAR YQSSFDQQAI DNIKWYPPVP AGLEAMEGAS LDRINAAN