Gene Sterm_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3200 
Symbol 
ID8598653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3352142 
End bp3353467 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content35% 
IMG OID 
ProductPTS system, lactose/cellobiose family IIC subunit 
Protein accessionYP_003309972 
Protein GI269121795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0308848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG ATAAAAAAAG AATATCTGAT TATTTCGGAA TGGTAGCTGT AAAACTGGGC 
GGACAGATTC ACCTGAGATC TCTCAGGGAC GGCTTTGCAT CTATTATGCC TTTTATGATT
CTGGCAGGGT TTGTAATATT TATTAATTAT GTTATTTTAG AACCGTCAGG ATTTATGGGG
AAAATAATAG ATCCAAAAAT TCTTACAAGG CTTCAGGAAA TAGGAAGCTC GGTTTCAAAC
GGAACACTCG GCATAATAAC AGTCATTGTT ACTGCGTCTG TTTCATATCA TCTAAGCCAG
AACAGAAATT TTGATAATGT ACTTGCATCA GTATTAGTAA GCTTGTCTAC ACTTTTTGTC
GTAACTCCTT TTATGAGTAC GTTTAAACCT GAAGGGCTGA ATGAAAGCTT TGTTGTAAAT
GGTGTTATTC CTGTGAATTA TACAAATGCT ACTGGTATGT TCGTAGGAAT TATAGTCGGT
CTTTTTGCTA CTGATATATT TATAAAGCTT TCGGCAAATA AAAAGCTTCA GATAAATATA
GCGGGAGATA TTCCGCCGGC AGTAATCAAA TCATTTAATG TACTGATCCC TATTATGATA
AATGTAATTA TCTTTGCAAT AATATCGTTT TTATTGAATC TTCTGTTTAA ATTAGATTTT
AATCAGCTGA TTTCCATGCT TATAACAAAG CCTCTAAGCC ATGTGACAAC AAGTCTTTTC
GGATTTTTAT TTTTAATGTG TCTGGGAAAT CTGTTTTTTG GATTCGGGAT TCATCAGGCT
GTTATATCCA ATCCTCTGCT GGATCCGTTT CTGCTCCAGA ATATGCAGGA AAATATGCTG
GCATATGCGA ATCATCAGCC CATACCTCAT ATAATTACTT CCGCATTTAA AGATGTTTTT
GGTATAACAG GAGGTTCGGG CAATACAATA GCTCTTCTTA TAGCGATTTT TATTTTTGGA
AGAAGAAAAG ATTATAAAGA TGTTGCTAAA ATGTCATTTA TGCCAGGTTT ATTCAATATA
AACGAACCGG TGATTTTTGG ACTGCCTATA GTTTTTAATC CGTTCCTTAT TGTTCCGTTT
GTAATAGCAC CGGTATTTTC TCTGCTTACT GCTTATTTTG CCACTTCAGT GGGACTTATA
AATCATGTAG TGGTACAGAT TCCATGGACA ACGCCTCCGG TTATTTCAGC GTTTCTTGCT
ACAGGAGGAG ACTGGCGTGC AGCGGTGCTG CAGTTAGTGA TTATTATAAT TACTATATTT
ATATATCTTC CTTTCCTGAA AATGGACGAA CGCATGTCAA AAATCAATAA GGATGATATT
TCCTGA
 
Protein sequence
MKKDKKRISD YFGMVAVKLG GQIHLRSLRD GFASIMPFMI LAGFVIFINY VILEPSGFMG 
KIIDPKILTR LQEIGSSVSN GTLGIITVIV TASVSYHLSQ NRNFDNVLAS VLVSLSTLFV
VTPFMSTFKP EGLNESFVVN GVIPVNYTNA TGMFVGIIVG LFATDIFIKL SANKKLQINI
AGDIPPAVIK SFNVLIPIMI NVIIFAIISF LLNLLFKLDF NQLISMLITK PLSHVTTSLF
GFLFLMCLGN LFFGFGIHQA VISNPLLDPF LLQNMQENML AYANHQPIPH IITSAFKDVF
GITGGSGNTI ALLIAIFIFG RRKDYKDVAK MSFMPGLFNI NEPVIFGLPI VFNPFLIVPF
VIAPVFSLLT AYFATSVGLI NHVVVQIPWT TPPVISAFLA TGGDWRAAVL QLVIIIITIF
IYLPFLKMDE RMSKINKDDI S