Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3848 |
Symbol | |
ID | 6271175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3574473 |
End bp | 3575522 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641727704 |
Product | hypothetical protein |
Protein accession | YP_001882139 |
Protein GI | 187730420 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACCC CTCAACCCGA TAAAACGGGC ATGCACATTC TGCTCAAGTT GGCCTCGCTG GTAGTGATCC TCGCGGGCAT TCACGCAGCG GCAGATATCA TTGTGCAGCT GTTACTGGCG CTGTTTTTTG CCATCGTCCT CAACCCGCTC GTTACCTGGT TTATTCGTCG GGGAGTACAA CGCCCCGTTG CCATTACGAT TGTAGTGGTG GTGATGCTGA TCGCACTAAC CGCGCTGGTC GGCGTACTGG CGGCATCGTT TAACGAATTT ATCTCTATGC TGCCGAAGTT TAATAAGGAG CTGACGCGCA AACTTTTTAA ATTGCAGGAG ATGTTGCCTT TTCTTAATTT GCATATGTCG CCGGAGCGAA TGCTGCAGCG GATGGACTCG GAAAAAGTGG TTACCTTCAC CACAGCGCTA ATGACCGGGC TTTCCGGGGC AATGGCGAGC GTGCTTTTGC TGGTGATGAC CGTAGTTTTT ATGCTGTTTG AAGTGCGCCA CGTCCCTTAC AAAATGCGTT TTGCGCTGAA TAACCCACAG ATTCACATCG CAGGATTACA CCGCGCACTT AAAGGCGTTT CGCATTATCT GGCGTTAAAG ACGCTACTCA GTTTATGGAC AGGTGTCATC GTCTGGCTGG GGCTGGCGCT AATGGGCGTA CAGTTTGCGC TGATGTGGGC AGTACTGGCG TTTTTGCTCA ACTACGTGCC CAATATCGGC GCGGTAATTT CCGCCGTACC GCCAATGATT CAGGTGCTGC TGTTTAATGG TGTTTACGAA TGTATTCTGG TCGGCGCATT GTTTTTAGTG GTCCATATGG TCATCGGCAA TATTTTAGAA CCACGGATGA TGGGCCATCG CCTGGGGATG TCCACCATGG TGGTATTTCT TTCATTGTTA ATTTGGGGAT GGCTGCTCGG CCCGGTAGGG ATGCTACTTT CGGTGCCATT AACCAGCGTG TGTAAAATCT GGATGGAAAC CACCAAAGGC GGTAGCAAAC TGGCGATTTT ACTGGGGCCA GGCAGACCGA AAAGTCGGTT ACCGGGATGA
|
Protein sequence | METPQPDKTG MHILLKLASL VVILAGIHAA ADIIVQLLLA LFFAIVLNPL VTWFIRRGVQ RPVAITIVVV VMLIALTALV GVLAASFNEF ISMLPKFNKE LTRKLFKLQE MLPFLNLHMS PERMLQRMDS EKVVTFTTAL MTGLSGAMAS VLLLVMTVVF MLFEVRHVPY KMRFALNNPQ IHIAGLHRAL KGVSHYLALK TLLSLWTGVI VWLGLALMGV QFALMWAVLA FLLNYVPNIG AVISAVPPMI QVLLFNGVYE CILVGALFLV VHMVIGNILE PRMMGHRLGM STMVVFLSLL IWGWLLGPVG MLLSVPLTSV CKIWMETTKG GSKLAILLGP GRPKSRLPG
|
| |