Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2101 |
Symbol | |
ID | 6269474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1909078 |
End bp | 1910361 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641726138 |
Product | integral membrane protein, PqiA family |
Protein accession | YP_001880632 |
Protein GI | 187731021 |
COG category | [S] Function unknown |
COG ID | [COG2995] Uncharacterized paraquat-inducible protein A |
TIGRFAM ID | [TIGR00155] integral membrane protein, PqiA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000000268144 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTTA ACACACCACA AATTACGCCG ACAAAAAAGA TAACAGTGAG GTCAATCGGC GAGGAACTGC CGCGTGGTGA TTACCAACGT TGCCCGCAAT GTGACATGCT GTTTAGCCTG CCCGAGATAA ATTCTCATCA GAGTGCCTAT TGTCCGCGCT GTCAGGCAAA AATTCGCGAC GGGCGCGACT GGTCGCTAAC GCGCCTGGCG GCAATGGCTT TCACCATGCT GTTGCTGATG CCGTTTGCCT GGGGCGAACC GCTGTTGCAT ATCTGGCTGT TAGGTATTCG TATCGACGCA AACGTCATGC AAGGCATCTG GCAAATGACC AAACAGGGCG ATACGATAAC GGGGGCGATG GTCTTTTTCT GCGTTATTGG TGCCCCCCTC ATTCTGGTGT CCTCCATAGC ATATTTATGG TTTGGTAACC GACTGGGAAT GAATTTACGT CCGGTACTGC TGATGCTTGA GCGTCTTAAA GAGTGGGTAA TGCTGGATAT CTACCTGGTC GGCATTGGCG TTGCTTCTAT AAAGGTACAG GATTATGCCC ATATCCAGGC GGGTGTCGGC TTGTTCTCTT TTGTGGCGTT GGTGATTTTA ACGACGGTGA CGTTGTCACA TCTTAATGTT GAGGAGCTGT GGGAGCGATA TTATCCGCAG CGCCCCGCTA CGCGTAGGGA CGAGAAACTT CGTGTCTGTC TTGGGTGCCA TTTTACCGGC TATCCTGATC AGCGTGGTCG CTGCCCGCGT TGCCATATCC CGCTACGCCT GCGTCGTCGT CATAGTTTGC AAAAATGCTG GGCGGCGCTG TTAGCGTCAA TCGTTTTGTT GTTACCAGCC AACCTATTGC CTATTTCTAT CATTTATCTG AATGGTGGCC GGCAGGAAGA TACGATTCTT TCCGGGATTA TGTCGCTGGC AAGTAGCAAC ATTGCGGTCG CCGGAATCGT GTTTATCGCC AGTATTCTGG TACCGTTTAC TAAAGTGATC GTCATGTTCA CTTTACTGTT GAGCACTCAT TTTAAATGCC AGCAAGGTTT ACGCACACGC ATTCTGTTAC TGCGGATGGT GACCTGGATT GGACGCTGGT CGATGCTCGA CCTGTTTGTC ATATCTTTAA CCATGTCGCT GATTAATCGC GATCAGATCC TCGCTTTTAC TATGGGACCG GCTGCGTTTT ATTTTGGCGC AGCCGTAATT TTGACTATTC TTGCTGTGGA ATGGCTGGAC AGCCGCTTAC TTTGGGATGC ACATGAGTCA GGAAACGCCC GCTTCGACGA CTGA
|
Protein sequence | MALNTPQITP TKKITVRSIG EELPRGDYQR CPQCDMLFSL PEINSHQSAY CPRCQAKIRD GRDWSLTRLA AMAFTMLLLM PFAWGEPLLH IWLLGIRIDA NVMQGIWQMT KQGDTITGAM VFFCVIGAPL ILVSSIAYLW FGNRLGMNLR PVLLMLERLK EWVMLDIYLV GIGVASIKVQ DYAHIQAGVG LFSFVALVIL TTVTLSHLNV EELWERYYPQ RPATRRDEKL RVCLGCHFTG YPDQRGRCPR CHIPLRLRRR HSLQKCWAAL LASIVLLLPA NLLPISIIYL NGGRQEDTIL SGIMSLASSN IAVAGIVFIA SILVPFTKVI VMFTLLLSTH FKCQQGLRTR ILLLRMVTWI GRWSMLDLFV ISLTMSLINR DQILAFTMGP AAFYFGAAVI LTILAVEWLD SRLLWDAHES GNARFDD
|
| |