Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4766 |
Symbol | |
ID | 6273260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4447266 |
End bp | 4448270 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641728520 |
Product | hypothetical protein |
Protein accession | YP_001882915 |
Protein GI | 187731231 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACT TCACGACCAG CACGCCACAT GACGCATTAT TTAAATCCTT TCTCACTCAC CCTGACACCG CGCGGGATTT TATGGAGATC CACATACCCA AAGATTTACG TGAACTGTGC GATCTCGACA GCTTAAAACT GGAATCCGCC AGCTTTGTCG ATGAAAAATT GCGGGTGCTA CACTCCGATA TTCTGTGGTC GGTAAAGACC CGTGAAGGTG ATGGTTATAT TTACGTAGTG ATTGAACATC AGAGCCGCGA GGATATTCAT ATGGCATTTC GCCTGATGCG ATATTCCATG GCGGTGATGC AGCGCCATAT CGAGCATGAT AAACGCCGGC CGCTACCGCT GGTTATCCCG ATGCTGTTTT ATCACGGTAG CCGTAGTCCT TACCCCTGGT CCCTGTGCTG GCTGGACGAA TTTGCCGCCC CGACTACCGC ACGGAAACTT TATAGCGCAG CGTTCCCGCT GGTGGATGTC ACTGTCGTGC CAGACGACGA GATTGTGCAG CATCGCAGAG TCGCCCTGTT GGAGTTGATC CAAAAGCATA TTCGCCAGCG CGATTTGATG GGGCTTATTG ACCAACTGGT AGTATTACTG GTTACAGAGT GTGCTAATGA CAGCCAGATA ACTGCGCTGT TAAATTACAT TTTACTGACT GGCGATGAAG CGCGTTTTAA GGCGTTTATC AGCGAACTTA CCAGGCGAAT GCCACACCAC AGGGAGCGAA TAATGACAAT TGCAGAGCGA ATTCATAATG ATGGATGGCT GTTGGGAAGG GAGAGGGGGA GGAAAGAAGG GAAAGTAGAA GGGGAACGGA GCCTCCTCCG ATTGTTGTTG CAGAATGGGG CCGATCCTGA ATGGATACAA CGATATACCG GACTTTCGGC AGAGCAAATG CAGGCATTAG ATCTGAAGTG GCACACTGAA TTTGGCCACC TGAACAGAGG TGATATGCTC ACCTCAGAAC AACACAGGTG CTCCAATGAA AAAAAGAAAT TTTAG
|
Protein sequence | MTNFTTSTPH DALFKSFLTH PDTARDFMEI HIPKDLRELC DLDSLKLESA SFVDEKLRVL HSDILWSVKT REGDGYIYVV IEHQSREDIH MAFRLMRYSM AVMQRHIEHD KRRPLPLVIP MLFYHGSRSP YPWSLCWLDE FAAPTTARKL YSAAFPLVDV TVVPDDEIVQ HRRVALLELI QKHIRQRDLM GLIDQLVVLL VTECANDSQI TALLNYILLT GDEARFKAFI SELTRRMPHH RERIMTIAER IHNDGWLLGR ERGRKEGKVE GERSLLRLLL QNGADPEWIQ RYTGLSAEQM QALDLKWHTE FGHLNRGDML TSEQHRCSNE KKKF
|
| |