Gene Sterm_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2139 
Symbol 
ID8597604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2278450 
End bp2279805 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content39% 
IMG OID 
ProductPTS system, cellobiose-specific IIC subunit 
Protein accessionYP_003308924 
Protein GI269120747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00121046 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAT TTACTGCGTT TTTGGAAAAA CATCTTATGC CGATTGCAAC AAAATTAGCA 
ACAAACAAAT ACTTAACAGC CTTAAAAGAT TCATTTGTTT ATACTATGCC GTTTTTGATA
GTAGGGTCAG TAGTCCTTCT TTTGGTAAAT CTGCCAATAG GAGCACCCGA ACTTTCAGAA
GGTGTAAAGA ACCCTATGTA TGTGAAGTGG TATGGAGACT TTATGGCGCT GCATAAGGCA
TCTTTAGTTC AGCCGTTTTA TGTAAGTATG GGAATAATGT CTATATTTGT AGCTTTCGGA
ATAGGATACA GCCTATCACA GCAGTATCAG CTTAATGCCA TTACAGGAGG ATTTCTATCA
TTATTTACCT TCCTTATAAT GGGTGCTAAA TTTGACTGGT TGCCAATTGG TGAAGCAACA
GGAGGACCTG CATTATTTCA CATAGCAGAA GGCGGATGGA TGCCTGTGAT GGACGGACGG
TATCTGGATG CAAACGGATT ATTTACGGCA ATAATCGGAG GCTTTATAGC AGTGGAAATA
TACAGATTTA TGTTAAAAAA AGGATTTGTA ATTAAGCTTC CGGAGTCAGT TCCGCCGGCA
ATAGCAAGAT CATTTGAATT GTTAATGCCT ATAGTTGTGG TAATAATTAT ATTCCAGCCG
CTTAGTATCT TTGTACAAAG TAAGGCAAAT GTAATGATAC CTGAATTACT TATGGGAATT
GTAAGACCGA TAATAAAAGC TTCTGATACT CTGCCGGCAG TATTGTTTAT ACTATTAATA
GTACATTTAT TGTGGTTCTG CGGACTTCAC GGTGTAAACG TCGTGGTAGC AGTTATAAAT
CCGATTATTT TAAGCAATCT TGCGGAAAAT CAGGCGGCAT TGCAGGCCGG GCAGCAGATA
CCAAGAATAT TCGCAGGTGG TTTTCTTGAT GCATTCGTAT ATCTCGGCGG TTCTGGAGCA
ACAATAGGTC TGGCAATAGC AATGGCACTT TCAAAGAATG CCCATATGAA ATCAATAGGA
AGACTCTCAG TGGTTCCGGG AATCTTCAAT ATAAATGAAC CGGTAATTTT CGGTGCTCCG
ATAGTCATGA ATCCGGTATT GTTCATTCCG TTCCTGTTCG TACCTATGAT AAATGCAACA
ATAGCATGGA TATGTCTGAA AACAGGACTT GTAGGAAGAA TAGTAACACT GGTTCCATGG
ACTACTCCGT CACCAATAGC AGCATTGCTT GCTACGAACT TTAATGTAAT GGCTTTTGTA
TTAAGTGCAT TCCTTGTAGT ATTATCAACA ATATTATATC TGCCTTTCCT GAAAGCATAT
GCAGATATAC TTAATAAACA GGAAGCAGCT CAATAA
 
Protein sequence
MEKFTAFLEK HLMPIATKLA TNKYLTALKD SFVYTMPFLI VGSVVLLLVN LPIGAPELSE 
GVKNPMYVKW YGDFMALHKA SLVQPFYVSM GIMSIFVAFG IGYSLSQQYQ LNAITGGFLS
LFTFLIMGAK FDWLPIGEAT GGPALFHIAE GGWMPVMDGR YLDANGLFTA IIGGFIAVEI
YRFMLKKGFV IKLPESVPPA IARSFELLMP IVVVIIIFQP LSIFVQSKAN VMIPELLMGI
VRPIIKASDT LPAVLFILLI VHLLWFCGLH GVNVVVAVIN PIILSNLAEN QAALQAGQQI
PRIFAGGFLD AFVYLGGSGA TIGLAIAMAL SKNAHMKSIG RLSVVPGIFN INEPVIFGAP
IVMNPVLFIP FLFVPMINAT IAWICLKTGL VGRIVTLVPW TTPSPIAALL ATNFNVMAFV
LSAFLVVLST ILYLPFLKAY ADILNKQEAA Q