Gene Sterm_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1201 
Symbol 
ID8596680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1303800 
End bp1305119 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content38% 
IMG OID 
ProductPTS system, lactose/cellobiose family IIC subunit 
Protein accessionYP_003308000 
Protein GI269119823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AACAAGGGAT TATGGATAAA TTAATAGTTT TTCTGGAAAA ATATCTTGAG 
CCTGTAGCTG CCAAAATCGA AAATCAAAGA CATATATCCA CTATAAAAAA CGGAATGATT
GCTCTTATGG CTGTTCTCAT GGTAGGATCA TTTTCACTGA TTATAATGGC TATAGGCGGA
ATGTTTCCGG ACGGCTCTGC CGTAAAAATG TTTTTTGAAA GATACAATAC TCTGATAAGC
CTTCCGTTCA GGTTTACATT CGGGCTTCTT TCAGTTTACT GCGCTGTCAG CATATCATAT
AATCATGCAA GACAGCTGAA AATTCCATTT TTACATGCTA TTATCGGAGG ACTTCTTACT
ACTCTTGTAT TAAATATAAA ACTTGTAGGT GATGAAGTAA ATATAGAATA TCTTGATTCA
AGAGGGCTGT TTATTGCTAT TTTTGCATCT TTGATTACAG TGGAAACTAT GGCATTTTTT
ATGAAAAATA AAATAACAAT CAGAATAAAA GGACTGCCCG ACGGAATTGC CCAGACTTTT
GAAGCAATTA TTCCTCTGGT TACTGTTTTA TTCGGCGCTG TTCTTGTAGA TGCACTGGTT
ATACATTTTA CAGGGGGAAG CAATCTGCCC GAAGCTTTTA CGACTTTCCT TGCACCGTCT
ATTAACAGTA TTGATACTCC TTATGCTATA TTCCTGATTT CATTTCTGGA AATGATATTC
TGGTTTATCG GATTGAATGG TTATGCCATT TTAATAGGAT TTGTTCTTCC TTTTATGACA
CAGTATCTTG GGGAAAATGC TGCGGCATAT GCAGCCGGAC TGCCTATCCC CCATGTTTTC
GCTCCTAATT TCTGGGATTA TTTTCTTGGT TTTTCCGGTT CGGGAATTAC CGGTGCCTTA
GTTATTCTAG CTCTGTTCAG TAAATCAAAG GAACTGAAAG CAATAGGAAA GGCCTCAGTG
GTACCTGCCA TATTTACAAT ATCAGAGCCT GTAGTATTCG GGCTTCCTGT AGTTTATAAC
CCTTATTTGT TTATACCGTT TGTATTTGGT ACACCGTGTA TCGGAGTTTT TGCATATTAT
GTATTTAAGC TGGGAATAGT TCGTCCCCCT ATTGCCAATG TAGGAGGAAC GCCTATACCG
CTGGCACAGT ATCTGGCTAC TATGGACTGG AAAGCGGTTG TGCTGGGATT TGTAATTTTG
GGACTGGCAG TCTGCATGTA TTATCCTTTT TTCAAAATGT ATGAAAGAAA AATTCTTCAG
GAAGAAAGTG TAGTCAGTGA CAGACAGGCA GCATTTGATG CTTTGGATTT AGATTTTTAA
 
Protein sequence
MNKKQGIMDK LIVFLEKYLE PVAAKIENQR HISTIKNGMI ALMAVLMVGS FSLIIMAIGG 
MFPDGSAVKM FFERYNTLIS LPFRFTFGLL SVYCAVSISY NHARQLKIPF LHAIIGGLLT
TLVLNIKLVG DEVNIEYLDS RGLFIAIFAS LITVETMAFF MKNKITIRIK GLPDGIAQTF
EAIIPLVTVL FGAVLVDALV IHFTGGSNLP EAFTTFLAPS INSIDTPYAI FLISFLEMIF
WFIGLNGYAI LIGFVLPFMT QYLGENAAAY AAGLPIPHVF APNFWDYFLG FSGSGITGAL
VILALFSKSK ELKAIGKASV VPAIFTISEP VVFGLPVVYN PYLFIPFVFG TPCIGVFAYY
VFKLGIVRPP IANVGGTPIP LAQYLATMDW KAVVLGFVIL GLAVCMYYPF FKMYERKILQ
EESVVSDRQA AFDALDLDF