Gene Sterm_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3720 
Symbol 
ID8599166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3949504 
End bp3950751 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content40% 
IMG OID 
ProductPTS system Galactitol-specific IIC component 
Protein accessionYP_003310485 
Protein GI269122308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.260527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAT TACAATGGAT TTTGGGTCTG GGTGCTTCTG TAATGCTTCC TATTATCATG 
TTTGTTTTTG CCATGATAAT GGGGGCAGGC TTTACAAAAT CATTCAGATC AGGTATTACT
ATCGGAATTG GTTTTACGGG AATCAATCTT GTTATCAATC TTCTTACCTC TTCGCTGGGA
CCGGCTTCAA TAGCAATGAC TGAAAGACTG GGACTGAATC TTACCATTAT TGATATAGGA
TGGCCTGCAA TGTCCGCTAT ATCATGGGCA TGGGTTGCTG CCGGTCTTCT GATTCCGGTT
GTTCTTGTTA TTAACTTTAT AATGCTGTCC TTAAAATTAA CTAAAACAAT GAATGTGGAT
TTATGGAATT TCTGGCAGTT TTCATTTATA GGAGCAGCAG TCACTGCTAC AAGCGGAAGC
GTAATGTGGG GAATGGTGGC AGCCAGTGTT GCGGCAGTCA TCGCACTTCT TTTGGCTGAT
TATACACAAA AATATATTGA AAACTATTTC GGTATGCCGG GAATTTCTTT TCCGCATCTG
ACAGCTCTTG GATTCATGCC GCTGGTTATT CCGCTTAACT GGATTTTTGA CAGAATTCCC
GGAATAAACA AATTGTACGC AAGTCCTGAT ACGATCAGAA AAAAATTCGG AATATTCGGA
GAGCCGATGA TAATGGGAAT AATCATAGGA GCATTACTTG GAATTCTTGC AGGCTTTGGA
GTAAGTGAAG TTTTGAAGCT GGCAATTGCG ATGGCATCTG TTATGTTTAT TATGCCGAAA
ATGGTGGCAA TATTAATGGA AGGACTTATA CCTGTTTCTG AGGCAGCAAG AGAATTCATG
GCTAAGAAAT TCGCAGGGCG GGAAATTTTT ATCGGGCTGG ATGCGGCAGT ATCTCTCGGG
GAGCCGTCAG TTATAGCAGT GGGGCTATTA ATGGTGCCGA TTACTATTAT TCTTGCCTTT
ATTGTTCCGG GAAATCAGCT GCTTCCATTT GCTGATCTGG CTGTTATTCC GTTTATAGTA
TGTCTGATCA CGGCAATGTC AAAAGGAAAC GTAATAAGAT CTTTGATGAT ATCTACGATT
GTTATGGCAA TTGTACTCGT ATTTGCTACA AATCTTGCAC CTGCTGAAAC AATTATGGCA
AAAACAGCAG GAGTAACGCT GCCTGACGGA GCGACATTAA TAGGAAATCT GGACAGAGCA
AATTTAATAA CGTGGCTTCT TGTTAAATTT TTCTCGTTAT TCAAATAA
 
Protein sequence
MEILQWILGL GASVMLPIIM FVFAMIMGAG FTKSFRSGIT IGIGFTGINL VINLLTSSLG 
PASIAMTERL GLNLTIIDIG WPAMSAISWA WVAAGLLIPV VLVINFIMLS LKLTKTMNVD
LWNFWQFSFI GAAVTATSGS VMWGMVAASV AAVIALLLAD YTQKYIENYF GMPGISFPHL
TALGFMPLVI PLNWIFDRIP GINKLYASPD TIRKKFGIFG EPMIMGIIIG ALLGILAGFG
VSEVLKLAIA MASVMFIMPK MVAILMEGLI PVSEAAREFM AKKFAGREIF IGLDAAVSLG
EPSVIAVGLL MVPITIILAF IVPGNQLLPF ADLAVIPFIV CLITAMSKGN VIRSLMISTI
VMAIVLVFAT NLAPAETIMA KTAGVTLPDG ATLIGNLDRA NLITWLLVKF FSLFK