Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_4040 |
Symbol | |
ID | 8599484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 4298611 |
End bp | 4299915 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003310803 |
Protein GI | 269122626 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AATTATTTTT ATTCATGGTA TTAATGTTAG GACTGATAAT CAGCTGTGGT GAAAAGAAAG ATACAGCAGG AGACGGCGGC GGAGAAAAAG AAGTTGTACT GAGATTTTCA TGGTGGGGCG GAGATTCGAG ACATAAAGCA ACACTTGATG CAATAAAATT ATTCGAGGAA AAAAATCCCG GAATAAAGAT AAAAGCTGAG TATTCGGGAT GGGACGGGCA TTTTGAGAAA GTATCAACAC AGGTTACGGG AAATACTGCA CCTGATATTA TGCAGATAGA CTGGAACTGG CTTTATATTT TTTCAAAAAA CGGTGATGGT TTTTATAATT TCAATGATTT GAAAGAAGAC TTTGATCTTT CAAATTATGA TGAAAATGTT TTGAGCTACA CTACAATTAA CGGAAAAGTA CCTGCTATTC CTGTAGGAAT GAACGGAAGG GTATTTTATT ACAATAAAGC ACTATATGAA AAAGCCGGGC TGTCGGTACC TGCAACAGCA GATGAATTGA TTTCTTCTAC AAAAATCTTA AAAGAAAAAT TCGGCAATGA TACATATGCT CTGGATATTT CAACAACTGA CAGCGGAGTA TTGTTCTTTT TGAAATATTA TGTAGAGCAG AAATTCGGAA AATCCCTGAT AGACAGTGAT AATAAAATGG GAATTACAAA AGAGGAATTA ACAGAAGCAA TACAGTTTTA TAAAAAGCTC GTGGATGAAG GTGTAGTACT ATCAAGTAAA GATAACGCAG GTGCGGGAAA TGTGCCGGGA GAACAGAATC CATTGTGGAT AAGCGGAAAG GTCGGAGGAG TTTATGAATG GAATTCAGCG ATAAGCAAGT ATCAGGACAC ATTAAGCGAA GGAAACGAAC TTATAATTGG TGATATGTTA ACAGGAATAG GACCTAATAA ATCAGCATTT GTAAAGGTAA ACATGGCTCT TGCAATAAAT AAAAACACAA AGCATCCTAA AGAGGCTGCA AAATTCCTGA ACTTTTTACT ATCTGATCCT GAGGCTGTAA AAATTCTGGG ACTCAGCAGA GGAATACCAT CGAATAAAAA AGCAATAGAA ACTCTGGATC AGGAAGGATT ATTAAAAGGG ATAGTACCGG AAGGTCTGGA AAAAGCTCTT GCTTTTGCAG CACCAAAATC AAGTCCTTTT ATAGAAGATG AAAGAATAAG AAAAATAGGA ATGATGTATA CACAAAAAGT AGATTACAAT GAATTAACTC CTGAACAGGC AGGGGAACAA ATGTATGCAG AGCTTGAAAA AGTCATTGCC CAGATAAGCA AATAA
|
Protein sequence | MKKKLFLFMV LMLGLIISCG EKKDTAGDGG GEKEVVLRFS WWGGDSRHKA TLDAIKLFEE KNPGIKIKAE YSGWDGHFEK VSTQVTGNTA PDIMQIDWNW LYIFSKNGDG FYNFNDLKED FDLSNYDENV LSYTTINGKV PAIPVGMNGR VFYYNKALYE KAGLSVPATA DELISSTKIL KEKFGNDTYA LDISTTDSGV LFFLKYYVEQ KFGKSLIDSD NKMGITKEEL TEAIQFYKKL VDEGVVLSSK DNAGAGNVPG EQNPLWISGK VGGVYEWNSA ISKYQDTLSE GNELIIGDML TGIGPNKSAF VKVNMALAIN KNTKHPKEAA KFLNFLLSDP EAVKILGLSR GIPSNKKAIE TLDQEGLLKG IVPEGLEKAL AFAAPKSSPF IEDERIRKIG MMYTQKVDYN ELTPEQAGEQ MYAELEKVIA QISK
|
| |