Gene Sterm_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4044 
Symbol 
ID8599488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4303580 
End bp4304890 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content26% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003310807 
Protein GI269122630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA GAGTTCTAAT TTGGTTATTT ATATTAACTA ACATAGGAAT AAGTGAAGAA 
ATAATAAACT CTAAAAATGA AAAAGAAAAT AATAAAATAG TAGTTGAAGA GGCTGTAATC
AAACAAAATG AAAATGAAGA TAAAAAATAT TATGAAATGA TTGATGGAAA AATTTACTAT
AAAGAATACT GGGATACCCC TGTATTAGTT AAAGAAATTG ATGTAAAAAC TTTCGCTGAG
TTAGAATATT CATATGCAAA AGATAAAAAT AATTATTATT ATAAAAATAA AAAAATACTT
GTGGATAAGA ATAGTTTTGT TATAGAAAAT TATTTTATTG CTAAGGATAA AAATAATGTT
TATGTGCTGG GCAGAAAGAT ACCGGGGTTT GCATCAGAAA AATTGAAGAT TTATGAGGGG
GATACCAGAT ATATAACAGA CGGTACAGAT GTATATTTTA TAGACACGAA GTTGATGAAT
TCCGACCCGA GTACTTTTGT AATTTTAGAT AATGAAACAG CAAAAGATAA AAATAATGTG
TATAAGTATG GAGAGATTTT ATACGGTGCA GATTCCGAAA CTTTTGAAAT ATTAGGAAAT
ATATACTCAA AAGATAAAAA TAAAGTATAC TCGATATCAT ATCCTATGGA TAAAGCTGAT
GCTAAAAGCT TCAAAAGTAT AGGAGATTGG TATGGAAAGG ATAAAAATTT TGTATTTTAC
AGAGATGATA TAGTGGAGAA TGCAGACTCC AAAACTTTCA AACATTTGGA ATATAAATAC
GGGATAGATA AAAATTATGT CTATTATTCA AATAAAAGAA TAGAAGATGC AGATCCTCAA
AGTTTTGTAT TATTGAATAA ATATGTAAGT AAAGACAAAA ATTATGTCTA TTATCTCACA
TCAAAAGTAC TAAATTTTAA ACCTGAAGAT CTAAAAGACA GAAATATAGA TACTGATAAA
TTTGTTATCC AAGAAAAAGA ATCTGAATAT AATATGGCAA TTTTAATTGA TGAATATGAA
AAGAGAGAGA AAAAAAGACT AGAAGATTTA AGTGATAAGG GAATTACAGA AATGGGTTTT
GATTATTATA TGTATAAAAA CTTGATTTAT TATAATGATA AATTTAGCAG GAAAATCTTT
TTATACAAAG CTGATATAGA AACTTTAGAA GCAGTAGAAG ATAGTTATAA TGAAATTTTG
AAGGATAAAA ACAGAGTATA CATAGCCGGG AGAATGTTAG AAGGGGCAGA TCCGGTAAGT
TTTGAAGTAA TAACTGGAAG ATATTATAAA GATAAAAAAA TGTTTACATA G
 
Protein sequence
MKKRVLIWLF ILTNIGISEE IINSKNEKEN NKIVVEEAVI KQNENEDKKY YEMIDGKIYY 
KEYWDTPVLV KEIDVKTFAE LEYSYAKDKN NYYYKNKKIL VDKNSFVIEN YFIAKDKNNV
YVLGRKIPGF ASEKLKIYEG DTRYITDGTD VYFIDTKLMN SDPSTFVILD NETAKDKNNV
YKYGEILYGA DSETFEILGN IYSKDKNKVY SISYPMDKAD AKSFKSIGDW YGKDKNFVFY
RDDIVENADS KTFKHLEYKY GIDKNYVYYS NKRIEDADPQ SFVLLNKYVS KDKNYVYYLT
SKVLNFKPED LKDRNIDTDK FVIQEKESEY NMAILIDEYE KREKKRLEDL SDKGITEMGF
DYYMYKNLIY YNDKFSRKIF LYKADIETLE AVEDSYNEIL KDKNRVYIAG RMLEGADPVS
FEVITGRYYK DKKMFT