Gene Sterm_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3787 
Symbol 
ID8599233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4025499 
End bp4026764 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content39% 
IMG OID 
ProductL-rhamnose isomerase 
Protein accessionYP_003310552 
Protein GI269122375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAATA AGGAAAATAT TATAAAAAGC TATGAGCTTG CAAAAGAACA ATACGGAAAA 
ATAGGTGTGG ATACAGATGA AATATTAAAA AGACTGGATG AAATAATGAT TTCTGTTCAT
TGCTGGCAGG GAGACGATGT GCAGGGGTTT GAAAATCCGG AGGGAGCATT AACCGGCGGA
ATTCAGGCTA CAGGGAATTA TCCCGGAAAG GCAAGAAGCG CCGACGAACT GAGACAGGAT
CTGGAAATGG TATTTAAGCT TGTACCCGGA AAACATAAGG TAAATCTTCA TGCGATATAC
GGAGAGTTTG GAGGAGAAAA AACAGGAAGA GACGAGATAA AGCCCGAACA CTTTAGGAAT
TGGGTAGAAT GGGCAAAAAA AATGAATCTT GGTCTTGATT TTAACCCGAC ATTATTCTCA
CATCCCCTTG CAGAAGAAGC GACTTTATCA CATCCTGACA AAAAAATAAG GGATTATTGG
ATAAAGCACT GCAAAGCTTC GAGAAAAATA GGGGAGTATT TCGGAAAGGA GCTTGGTATT
CCGTGTGTAA CTAATATATG GATTCCCGAC GGATATAAGG ATATACCCGT AGACAGATAT
ATGCCGAGAA AACGTCTGAA AGAATCACTT GATGAAATTA TGAAAGAAAA AATCAACCCT
GAATATAATC TTGATGCAGT GGAATCAAAG GTATTCGGTC TGGGGCTGGA ATCTTACACT
GTGGGATCAC ATGAATTTTA TATGGGTTAT GCTGTGGAAA ATAAAACACT TTTGTGTCTT
GATGCAGGAC ATTTTCATCC TACGGAAGTA ATATCGGATA AGATACCGTC AGTGCTCTTA
TTTCTTGATC AGATACTGCT TCATGTAAGC AGACCGGTAA AATGGGACAG TGATCATGTT
GTTATTCTGG ATGACGAGCT GAGAGCAATA GCAAATCAGA TTATAAGATA TGATTTTGAT
AAAAGAGTGC ATATAGGGAT AGATTTCTTT GATGCAAGTA TAAACAGAAT AGCAGCATGG
GCAATAGGAG TAAGAAACAC AAGAAAGGCA TTAATGCTTG CTCTTTTGGA GCCAGTGGAA
AAATTAAAGG AAACAGAGCT TTCGGGAGAT TTCACAAGCA GACTTGCACT TCTTGAAGAA
TATAAAATGT ATCCTGCCGG AGCAGTATGG GATTATTACT GCAGTGTAAA GAATATCCCA
GCAGGGGAAG CATGGCTTGA GATTGTAAAG GAATATGAAA GAAATGAATT AAGCAAAAGA
AGCTGA
 
Protein sequence
MQNKENIIKS YELAKEQYGK IGVDTDEILK RLDEIMISVH CWQGDDVQGF ENPEGALTGG 
IQATGNYPGK ARSADELRQD LEMVFKLVPG KHKVNLHAIY GEFGGEKTGR DEIKPEHFRN
WVEWAKKMNL GLDFNPTLFS HPLAEEATLS HPDKKIRDYW IKHCKASRKI GEYFGKELGI
PCVTNIWIPD GYKDIPVDRY MPRKRLKESL DEIMKEKINP EYNLDAVESK VFGLGLESYT
VGSHEFYMGY AVENKTLLCL DAGHFHPTEV ISDKIPSVLL FLDQILLHVS RPVKWDSDHV
VILDDELRAI ANQIIRYDFD KRVHIGIDFF DASINRIAAW AIGVRNTRKA LMLALLEPVE
KLKETELSGD FTSRLALLEE YKMYPAGAVW DYYCSVKNIP AGEAWLEIVK EYERNELSKR
S