Gene Sterm_4129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4129 
Symbol 
ID8599573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4410129 
End bp4411505 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content37% 
IMG OID 
Productpeptidase M18 aminopeptidase I 
Protein accessionYP_003310892 
Protein GI269122715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AGAATTTATG GACAGGCTAT ACAGACAAAG AAAAAAAAGA GATAGAGCAG 
CTGGGTTCAG AGTATAAAGA ATTCCTGAAT AACGGGAAAA CAGAAAGAGA AGTAGTAGAA
ATAACTATGG AGATTCTGGA AAAAAACGGA TTTAAAAATA TAGAAAAAGT AAAGTCGCTG
AAAGCAGGGG ATAAGATATT TTTTAATAAC AGAAATAAAA ATATACTTAT GAGCATAATC
GGGAAAGAAG ATATGAAAAA AGGGATAAAC ATGGTAGTTT CACATGTGGA TTCACCCAGA
CTGGATCTCA AAATGAACCC TCTTTTGGAG GACGAAGAGT TTTTGATGCT GAATACTCAT
TATTACGGCG GAATAAAAAA ATATCAATGG GTAGCCACAC CGCTGGCAGT TCACGGAGTA
GTATTTTTGA AAAACGGTAA AAAAGTAAGC TTCGCAATAG GAGAAAGGGA AGAGGATCCG
GTTTTCTGCA TACCTGATAT ACTTCCGCAT TTATCAAGAA ATGTTCAGGA TGACAGAAAA
ACAAGAGAAG TAATAAAAGG TGAAGAGCTG AAGGTATTAT TTGGTTCCAT ACCTGTAAAA
GACAAAGATG CAAAAGAAAA GATAAAAGCC AATATACTGG AACATCTGAA AAAAGATTAT
GGAATAGAAG AAGAAGATTT TTTCTCTGCG GAGATAGAAA TAGTTCCGGC ATTAAAGGCA
AGGGATATAG GTCTGGACAG AGGAATGATA GGTGCCTATG GTCAGGATGA CAGAATCTGT
GCCTACACAT CGTTAAAAGC TCTTTTGGAT ATAAAAAAGC CTGAAAAAAC AGTACTATGC
TATTTTGCAG ATAAAGAAGA AGTGGGCAGT GACGGCAGTA CAGGGCTGAA TTCCAGTTTA
ATAGAATACT TTACGGGAAA GCTTCTGAAA TTAACTGGAA AAGATTATGA TGATCAGCTG
CTTAGAGAAA CTCTCTGGAA TTCCAAAGCT ATATCTGCCG ATGTTACTGC AGGAGTAGAC
CCTATTTTCA AATCTGTGCA TGATATGAAT AATTCGGCAA AGCTGTCACA TGGGATTCCT
GTTGCCAAAT ATACAGGACA TGGAGGAAAA AATGGCTCAA ATGATGCTGA TGCAGAATAT
ATGTATGAAA TAAGAGAAAT ATTCGATAAA AATAAGGTAG CCTATCAGGT AGGCGGATTT
AGCAAGGTAG ACGAAGGCGG GGGCGGAACT GTAGCAAAAT TTCTGGCATA TTTCGGAATA
AGAACTGTTG ATATAGGACC AGCATTGCTG TCGATGCACT CTTTGTTTGA GGTTTCTTCC
AAAGCGGATA TTTATGAGGC TTATAAAGCT TATAAAGCTT TTTATTCTAT AAAATAA
 
Protein sequence
MNKKNLWTGY TDKEKKEIEQ LGSEYKEFLN NGKTEREVVE ITMEILEKNG FKNIEKVKSL 
KAGDKIFFNN RNKNILMSII GKEDMKKGIN MVVSHVDSPR LDLKMNPLLE DEEFLMLNTH
YYGGIKKYQW VATPLAVHGV VFLKNGKKVS FAIGEREEDP VFCIPDILPH LSRNVQDDRK
TREVIKGEEL KVLFGSIPVK DKDAKEKIKA NILEHLKKDY GIEEEDFFSA EIEIVPALKA
RDIGLDRGMI GAYGQDDRIC AYTSLKALLD IKKPEKTVLC YFADKEEVGS DGSTGLNSSL
IEYFTGKLLK LTGKDYDDQL LRETLWNSKA ISADVTAGVD PIFKSVHDMN NSAKLSHGIP
VAKYTGHGGK NGSNDADAEY MYEIREIFDK NKVAYQVGGF SKVDEGGGGT VAKFLAYFGI
RTVDIGPALL SMHSLFEVSS KADIYEAYKA YKAFYSIK