Gene Sterm_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_0474 
Symbol 
ID8595962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp519047 
End bp520903 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content35% 
IMG OID 
ProductPTS system, beta-glucoside-specific IIABC subunit 
Protein accessionYP_003307282 
Protein GI269119105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000629343 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTATA AAAAAATCGG CAAGGAAATA TTTGATCTTG TTGGAGGTAA TAATAACATT 
ACTCAAATGA CACATTGTGC GACGCGGTTA AGATTTGTAC TAAAGGATTT TTCAAGTATT
GATCATGAAA AACTAAAAGC AATTCCCGGA GTTTTAGATG TTATATTTAA GGGAGGCCAG
TTACAAGTTA TCATTGGTCC TGATGTACCA GAAGTTTACC GGGCAGCCGA CGCTCTTTAT
ACTGGTTCAA AAAATAATAA TAATGAAAAG GGAGAAAATC AAAATATACT TAATCGTATT
ATGGCAGTTG TTGTAGGTAT TTTCAATCCA ATGTTACCTG CTATTACAGG TGCTGGTATG
ATTAAAGCTG TTTTAGCTCT TTTAAAGGCA TTTTCTCTAA TTGATGTTAA TGGTCAGACT
TTTGCTATAT TATCATTTAT TTCAGATTCA GCCTTTTATT TTATGCCGAT GATATTAGCA
TATTCATCAG CAAAAGTATT TAAATGCAGT CCCGGACTTG CTATTACACT AGCTGGTGTA
TTATTACATC CAAACTTTAT TGCAATGAAA AATGCTGGTG AAGCAGTCAA ATTCGCAGGA
ATCAATGTAC CGCTTGCTGG TTATGCCTCG ACAGTTATTC CTATTATCCT TATTGTTTTT
TTAATGTCTT ATGTTGAACG GTTTGCAGAA AAAATATTGC CGGTTCATAT TAAATATATA
GGCAGACCGC TTATTATTTT ACTTGTCATG GCACCACTTT CTTTAATTGT TGTTGGTCCG
CTTGGATTCA ACATTGGAAA CATTCTTGCC GCAGGTATAG CATTTTTAGA TAATAAAGCT
GGCTGGCTTG TTCCAACAGT TATTGGAACA TTCACACCTT TATTAGTTAT GTTCGGTCTC
CATAACGGAT TATTTCCAAT TGCTACAACT CAGCTTGGAG TTTCAGGGCA TGAATCAATT
ATGGGACCTG GTATGTTACC GTCTAATGTT GCTCAAGGTG CTGCTTGTAT GGCTGTTGCA
GTTAAAACAA AAAGCAAGGA AATGCGTCAG CAGGCTATCT CTGCTGGAAT AACTGCATTA
CTTGGAATTA CAGAGCCGGC AATGTATGGT GTAACTCTTC GATTGAAAAA ACCTTTAATT
GCTGTTATGA TAGGTGGCGG CCTTGGCGGA CTTTATGCAG GACTTACTGG TGTTGTTAGG
TATTCCTTTG GTTCTCCTGG ATTTGCAACT CTTCCGGTTT TTATTTCAGA TGATCCTGCA
AACATCAGAA ATGCTTTAAT TTCTGCTTTT ATTGGTATTA TTGTTTCATT TGTATTGACA
CTGCTTATAA AATTTGACTA TAATTATGGT TCACCAGAAG AAGTAATAAC GGATCAAGCA
CTATCTGCTG CCAGACCAAT TTTACAAACA GCTGTTATTA ACAGTCCGTT AAGCGGAGAA
GTTACCAGTC TAAGTGAAGT GAATGATGAA ATTTTTTCAA AAGGTCTGCT TGGAAAAGGT
GTTGCAGTTA TTCCTAATGA AGGTAAAGTC ACTGCGCCAT ATGATGCAGA AGTATCTATA
ATTGAAACAA AACATGCAGT AGCTTTTACC GGTGATAATG GTATTGATCT TCTAGTACAC
GTTGGTATTG ATACTGTAGA GTTAAATGGG AAATATTTTA ATTGTAAAGT TAAAAATGGG
GATAAAGTAA AAGCTGGTGA TGTTGTTTTA GAGTTTGATA TTAATGCTAT CAAAAAAGCA
GGTTACAAAA TAATTACCCC AATTATTATA ACTAATTCTA ATGAATTTGA AAGTATAACT
CAAATATCAT CAGAAAATAT TATTTCTGGC AAACCAATAT TAAATCTGGA AGTTTAG
 
Protein sequence
MDYKKIGKEI FDLVGGNNNI TQMTHCATRL RFVLKDFSSI DHEKLKAIPG VLDVIFKGGQ 
LQVIIGPDVP EVYRAADALY TGSKNNNNEK GENQNILNRI MAVVVGIFNP MLPAITGAGM
IKAVLALLKA FSLIDVNGQT FAILSFISDS AFYFMPMILA YSSAKVFKCS PGLAITLAGV
LLHPNFIAMK NAGEAVKFAG INVPLAGYAS TVIPIILIVF LMSYVERFAE KILPVHIKYI
GRPLIILLVM APLSLIVVGP LGFNIGNILA AGIAFLDNKA GWLVPTVIGT FTPLLVMFGL
HNGLFPIATT QLGVSGHESI MGPGMLPSNV AQGAACMAVA VKTKSKEMRQ QAISAGITAL
LGITEPAMYG VTLRLKKPLI AVMIGGGLGG LYAGLTGVVR YSFGSPGFAT LPVFISDDPA
NIRNALISAF IGIIVSFVLT LLIKFDYNYG SPEEVITDQA LSAARPILQT AVINSPLSGE
VTSLSEVNDE IFSKGLLGKG VAVIPNEGKV TAPYDAEVSI IETKHAVAFT GDNGIDLLVH
VGIDTVELNG KYFNCKVKNG DKVKAGDVVL EFDINAIKKA GYKIITPIII TNSNEFESIT
QISSENIISG KPILNLEV