Gene Sterm_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4040 
Symbol 
ID8599484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4298611 
End bp4299915 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003310803 
Protein GI269122626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AATTATTTTT ATTCATGGTA TTAATGTTAG GACTGATAAT CAGCTGTGGT 
GAAAAGAAAG ATACAGCAGG AGACGGCGGC GGAGAAAAAG AAGTTGTACT GAGATTTTCA
TGGTGGGGCG GAGATTCGAG ACATAAAGCA ACACTTGATG CAATAAAATT ATTCGAGGAA
AAAAATCCCG GAATAAAGAT AAAAGCTGAG TATTCGGGAT GGGACGGGCA TTTTGAGAAA
GTATCAACAC AGGTTACGGG AAATACTGCA CCTGATATTA TGCAGATAGA CTGGAACTGG
CTTTATATTT TTTCAAAAAA CGGTGATGGT TTTTATAATT TCAATGATTT GAAAGAAGAC
TTTGATCTTT CAAATTATGA TGAAAATGTT TTGAGCTACA CTACAATTAA CGGAAAAGTA
CCTGCTATTC CTGTAGGAAT GAACGGAAGG GTATTTTATT ACAATAAAGC ACTATATGAA
AAAGCCGGGC TGTCGGTACC TGCAACAGCA GATGAATTGA TTTCTTCTAC AAAAATCTTA
AAAGAAAAAT TCGGCAATGA TACATATGCT CTGGATATTT CAACAACTGA CAGCGGAGTA
TTGTTCTTTT TGAAATATTA TGTAGAGCAG AAATTCGGAA AATCCCTGAT AGACAGTGAT
AATAAAATGG GAATTACAAA AGAGGAATTA ACAGAAGCAA TACAGTTTTA TAAAAAGCTC
GTGGATGAAG GTGTAGTACT ATCAAGTAAA GATAACGCAG GTGCGGGAAA TGTGCCGGGA
GAACAGAATC CATTGTGGAT AAGCGGAAAG GTCGGAGGAG TTTATGAATG GAATTCAGCG
ATAAGCAAGT ATCAGGACAC ATTAAGCGAA GGAAACGAAC TTATAATTGG TGATATGTTA
ACAGGAATAG GACCTAATAA ATCAGCATTT GTAAAGGTAA ACATGGCTCT TGCAATAAAT
AAAAACACAA AGCATCCTAA AGAGGCTGCA AAATTCCTGA ACTTTTTACT ATCTGATCCT
GAGGCTGTAA AAATTCTGGG ACTCAGCAGA GGAATACCAT CGAATAAAAA AGCAATAGAA
ACTCTGGATC AGGAAGGATT ATTAAAAGGG ATAGTACCGG AAGGTCTGGA AAAAGCTCTT
GCTTTTGCAG CACCAAAATC AAGTCCTTTT ATAGAAGATG AAAGAATAAG AAAAATAGGA
ATGATGTATA CACAAAAAGT AGATTACAAT GAATTAACTC CTGAACAGGC AGGGGAACAA
ATGTATGCAG AGCTTGAAAA AGTCATTGCC CAGATAAGCA AATAA
 
Protein sequence
MKKKLFLFMV LMLGLIISCG EKKDTAGDGG GEKEVVLRFS WWGGDSRHKA TLDAIKLFEE 
KNPGIKIKAE YSGWDGHFEK VSTQVTGNTA PDIMQIDWNW LYIFSKNGDG FYNFNDLKED
FDLSNYDENV LSYTTINGKV PAIPVGMNGR VFYYNKALYE KAGLSVPATA DELISSTKIL
KEKFGNDTYA LDISTTDSGV LFFLKYYVEQ KFGKSLIDSD NKMGITKEEL TEAIQFYKKL
VDEGVVLSSK DNAGAGNVPG EQNPLWISGK VGGVYEWNSA ISKYQDTLSE GNELIIGDML
TGIGPNKSAF VKVNMALAIN KNTKHPKEAA KFLNFLLSDP EAVKILGLSR GIPSNKKAIE
TLDQEGLLKG IVPEGLEKAL AFAAPKSSPF IEDERIRKIG MMYTQKVDYN ELTPEQAGEQ
MYAELEKVIA QISK