Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3121 |
Symbol | |
ID | 8598575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 3269583 |
End bp | 3270872 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | PTS system, lactose/cellobiose family IIC subunit |
Protein accession | YP_003309894 |
Protein GI | 269121717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0071332 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCTG GTTCTAAAAA AGCTTTTTCA GAAATATTAC TGGTATTTGC TGAAAAAATT GGAAAAAATA TTTATTTGTT AAGTCTGAGA GATGCATTTA TGCTTTCGTT CCCTCTTACA ATGTTCGGTT CTATACTACT GGTAATCAGC AATTTTCCAT TGATCTCCGC TGAAAAGAAG GATGTTCTAT GGAAACTGTT CGGACATTCC GTTGAGAGTT CCATGTTGTT AATGTCTATT TTTATAAGTC TGGGAATAGG ATATTATCTG TATAAATACA AAAATCCTGA AAAACCATCC GAGGCTTTAT ATTCAGCAGC AGTGGCGCTT ACTTCATTTT TTATAGTTAC ACCGTTTTCA CTGACACTTG AAGGGGAGAA TATTATTTCA GGTGTTATTC CTACTTCGCT GGTAGGTGCA CAGGGACTGT TTGTGGCTAT AATTATTTCA ATTCTGTCTA CTACTATTTA TAGTTTTATG ATTAACAAAA ATATAATTAT AAAAATGCCC AAAGAAGTTC CGCCGGCGAT TGCAAAATCT TTCTCAGCCA TTATTCCGGG TGCAATTACG CTTACTACAT TTATGATAAT AAGCCTGATC TTTGAAAGAA CTTCATTTCA TTCAATACAT AGTTTCGTAT ATGAATTTTT GCAAAAACCG TTAATAGGTT TGGGAACTTC ATTTGCAGCA ACTATGATTG CGGTATTTTT AGTACAGTTC TTCTGGTTTT TTGGGATACA CGGACATCTT GTAGTAAATC CGATCATGGA TACAGTGTGG AATGTGGCAT CTCTGGAAAA TTTAAATGCT TACAATAATA ATCTTCCGCT GCCGCACATT GTAACTAAAC AGTTCGTGGA AATGTTTACT GTGAGTGTAG GATCGATGGG GGCACTGTCA GCACTGACTG CTATATTTCT GGTAAGTAAG ATAAAACAGC AGCGTGAAGT AGCAAAGCTG GGATTTATCC CCGGTATTTT CAATATTGCA GAGCCGACTT TATTCGGACT TCCGGTAATA TTGAATCCGA TTATGGCTAT TCCGTGGATG TTAGGCGGAC CGCTTACTGC GGCAATTGCA TATTTTGCCA CAGTAACAGG AATAATGCCG AAAACAACAG GAGTTGCCGT ACCGTGGACA ATGCCGCTGG GAATAAGCGG AACTCTAGCT ACCAATTCAA TTATGGGCGG TGTAGTACAG GTGATATGTT TTATTGCAGT TGTACTGCTG TGGATTCCGT TTATTCTTTA TTCACAGAAA GAATTTAAGA CAAAAGAACA AAATACATAA
|
Protein sequence | MSSGSKKAFS EILLVFAEKI GKNIYLLSLR DAFMLSFPLT MFGSILLVIS NFPLISAEKK DVLWKLFGHS VESSMLLMSI FISLGIGYYL YKYKNPEKPS EALYSAAVAL TSFFIVTPFS LTLEGENIIS GVIPTSLVGA QGLFVAIIIS ILSTTIYSFM INKNIIIKMP KEVPPAIAKS FSAIIPGAIT LTTFMIISLI FERTSFHSIH SFVYEFLQKP LIGLGTSFAA TMIAVFLVQF FWFFGIHGHL VVNPIMDTVW NVASLENLNA YNNNLPLPHI VTKQFVEMFT VSVGSMGALS ALTAIFLVSK IKQQREVAKL GFIPGIFNIA EPTLFGLPVI LNPIMAIPWM LGGPLTAAIA YFATVTGIMP KTTGVAVPWT MPLGISGTLA TNSIMGGVVQ VICFIAVVLL WIPFILYSQK EFKTKEQNT
|
| |