Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2489 |
Symbol | |
ID | 6269034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2288485 |
End bp | 2291562 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641726481 |
Product | phage tail tape measure protein |
Protein accession | YP_001880961 |
Protein GI | 187733122 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACA ATAACCTGCG TCTGCAGGTG ATTCTTAATG CGGTTGACAA ACTCACCCGC CCATTTCGAT CCGCGCAGGC CAGTTCAAGA GAACTGGCTG CTGCTGTCAA AAAATCCCGC GATGCAATAA AGCAGCTTGA TCAGGCCGGG AGCAGTCTGG ACAGCTTCCG AAAGCTGCAG GCAGAAAATC AGAAATTAGG CGACAGGCTG AACTATGCCC GCCAGCGTGC AAATTTGCTC AGTCAGGAAC TGGGAGCGAT GGGGCCGCCT TCGCAACGTC AGGTTGTTGC TCTGGGCCGT CAACGGCTGG CTGTTCAGCG CCTGGAAGAA CGCCAGAAAA AGCTGCAGCA GCAGACGGCG CTTGTGCGTG CAGAACTTTA TCGTGCTGGT ATTTCAGCTA ATGATGGCGC CAGTGCGACG GCCCGCATTA CCCGTGAAAC GATGCGTTAT AACAGGCAGC TTTCTGAGCA GGAAGCGAGG TTACGACGTG TCGGGGAGCA ACAGCGAAAA ATGCACGCCG CCCGGGGGGC TTACGCCAGG CGTCTTGAGG TAAGGGATCG TATTGCAGGT GCCGGAGCCA CCACCACGGC TGCAGGGCTG GCAATGGGCG CACCAGTGAT GGCGGCAGTA AAAAGCTATA CCAGCATGGA AGATGCCATG AAAGGTGTGG CAAAGCAGGT CAATGGTCTG CGTGACGATA ATGGCAACCG CACTGCGCGT TTTTACGAAA TGCAGGGTGC CATCAAGGCT GCCAGCGAAC AGTTGCCGAT GGAAAACGGT GCTGTGGACT TCGCCGCACT GGTTGAAGGT GGTGCGCGCA TGAACGTCGC AAACCCTGAC GACAGCTGGG AAGACCAGAA ACGTGACCTG CTGGCCTTCG CCAGCACGGC GGCAAAGGCG GCAACAGCCT TTGAACTGCC AGCGGATGAA CTGTCAGAAA GTCTGGGGAA AATCGCCCAG CTCTACAAAA TCCCTACCCG CAATATTGAG CAGCTCGGTG ATGCGCTGAA CTATCTGGAT GATAACGCCA TGTCGAAAGG GGCGGACATC ATTGATGTCA TGCAACGCCT GGGCGGTGTG GCTGACCGTC TGGATTATCG TAAAGCGGCG GCGCTGGGTT CCACCTTCCT GACACTCGGC GCTGCGCCTG AGGTTGCAGC CAGTGCAGCA AACGCGATGG TGCGTGAATT GTCCATTGCC ACCATGCAAA GCAAGAGTTT CTTTGAAGGG ATGAATCTGC TGAAACTCAA TCCAGAAGTG ATTGAAAAGC AGATGACGAA GGATGCGATG GGAACTATCC AGCGGGTGCT GGAGAAGGTG AATGCGCTGC CACAGGATAA GCGCCTGTCT GCCATGACCA TGTTGTTTGG TAAAGAGTTT GGCGATGACG CAGCGAAACT GGCAAACAAC CTGCCGGAAC TTCAGCGCCA GTTAAAACTG ACTGCGGGCA ATGATGCGCT CGGTTCCATG CAGAAAGAAT CCGACATCAA CAAGGATTCA CTTTCTGCTC AGTGGTTGCT GGTCAAAACC GGAGCACAGA ACACCTTCAG CAGCCTGGGG GAAACGCTGC GCCAGCCGCT GATGGATATT CTGTACACGG TGAAAAGCGT CACGGGGGCG TTGCGTCGCT GGGTGGAAGC TAACCCGGAA CTGACGGGCA CTCTGATGAA AGTAGCCGCT GTGGTAGCTG CCGTTACCGT GGGCCTCGGC ACCTTAGCGG TGGCGCTGGC TGCAGTGCTG GGGCCGCTGG CAGTCATCCG TCTGGGATTC TCTGTGCTGG GTATCAAAAC GTTACCTTCC GTTACGGCAG CAGTAACCCG AACCAGCAAC GCGTTGTCCT GGCTGGCTGG CGCACCACTG GCACTGCTGC GACGCGGGCT TGCTTCATCG GGCAACGCTG CAGGTTTACT TACTGCGCCG TTGTCGTCTT TGCGCCGCAC GGCATCACTG ACGGGAAATG TCCTGAAAAC TGTAGCAGGT GCGCCGGTTG CACTTTTGCG GTCTGGATTA TCCGGTTTAC GTGCGGTTGC TGTGATGTTT ATGAATCCAC TGGCGGTACT ACGCGGCGGA CTGGCTGCCA CAGGCACGGT GCTGCGTATG CTCGCATCCG GTCCACTGGC GATGCTGCGC GTTGCCCTGT ATGCCGTCTC TGGTCTGTTA GGTGCTCTGC TCAGTCTGAT AGGTCTTGTG GTTACTGCAC TGGCGGGTGT GGCGCTGGTT GTCTGGAAAT ACTGGCAGCC CATCACCGCA TTTCTCGGTG GCGTGGTGGA AGGATTCAAA GCGGCGGCAG GTCCTGTCAG TGCCGCGTTC GAACCGCTTA AGCCTGTGTT CCAGTGGATT GGCGACAAAG TACAGGCGCT GTGGGGCTGG TTTACTGATC TGCTGACGCC CGTTAAGTCG ACCTCTGCCG AACTGCAGAG TGCAGCGGCA ATGGGGCGAC GATTCGGGGA GGCTCTGGCG GAAGGGCTGA ATATGGTCAT GCATCCGCTG GACTCCCTGA AATCCGGCGT TTCCTGGTTG CTGGAGAAAC TTGGCATTGT CAGTAAAGAG GCCGCAAAGG CGAAACTGCC GGAAAGCGTG ACGCGTCAGC AACCTGCGAC GGTGAATGCA GACGGTAAAG TGATGATGCC ATCGGGTGGT TTTCCGTCAT GGGGATATGG CTTTGCGGGG ATGTATGACA GCGGCGGGTA TATCCCGCGC GGGCAGTTTG GCATCGTCGG TGAAAACGGG CCGGAAATTG TTAACGGCCC GGCAAATGTG ACCAGCCGGA GAAATACAGC TGCACTGGCT GCCGTTGTTG CCGGAATGAT GGGTGTTGCT GCTACGCCAG CAGAGCTTCC ACCGTTGCAC CCTTTGGCAC TTCCCGCGAA AGGCGGTGAA GCGATGGTGA GTCGTGCAGC CACTGTGCCG CCCGTTCAAC GGATTGAGGC ACCGATGCAG ATCATCATTC AAACGCAGCC AGGACAAAGT GCGCAGGATA TTGCGCGGGA GGTGGCCCGC CAGCTTGATG AACGTGAACG CAGGCTGAAG GCAAAAGCCA GGAGTAACTA CAGCGATCAG GGGGGATACG ACGCATGA
|
Protein sequence | MSDNNLRLQV ILNAVDKLTR PFRSAQASSR ELAAAVKKSR DAIKQLDQAG SSLDSFRKLQ AENQKLGDRL NYARQRANLL SQELGAMGPP SQRQVVALGR QRLAVQRLEE RQKKLQQQTA LVRAELYRAG ISANDGASAT ARITRETMRY NRQLSEQEAR LRRVGEQQRK MHAARGAYAR RLEVRDRIAG AGATTTAAGL AMGAPVMAAV KSYTSMEDAM KGVAKQVNGL RDDNGNRTAR FYEMQGAIKA ASEQLPMENG AVDFAALVEG GARMNVANPD DSWEDQKRDL LAFASTAAKA ATAFELPADE LSESLGKIAQ LYKIPTRNIE QLGDALNYLD DNAMSKGADI IDVMQRLGGV ADRLDYRKAA ALGSTFLTLG AAPEVAASAA NAMVRELSIA TMQSKSFFEG MNLLKLNPEV IEKQMTKDAM GTIQRVLEKV NALPQDKRLS AMTMLFGKEF GDDAAKLANN LPELQRQLKL TAGNDALGSM QKESDINKDS LSAQWLLVKT GAQNTFSSLG ETLRQPLMDI LYTVKSVTGA LRRWVEANPE LTGTLMKVAA VVAAVTVGLG TLAVALAAVL GPLAVIRLGF SVLGIKTLPS VTAAVTRTSN ALSWLAGAPL ALLRRGLASS GNAAGLLTAP LSSLRRTASL TGNVLKTVAG APVALLRSGL SGLRAVAVMF MNPLAVLRGG LAATGTVLRM LASGPLAMLR VALYAVSGLL GALLSLIGLV VTALAGVALV VWKYWQPITA FLGGVVEGFK AAAGPVSAAF EPLKPVFQWI GDKVQALWGW FTDLLTPVKS TSAELQSAAA MGRRFGEALA EGLNMVMHPL DSLKSGVSWL LEKLGIVSKE AAKAKLPESV TRQQPATVNA DGKVMMPSGG FPSWGYGFAG MYDSGGYIPR GQFGIVGENG PEIVNGPANV TSRRNTAALA AVVAGMMGVA ATPAELPPLH PLALPAKGGE AMVSRAATVP PVQRIEAPMQ IIIQTQPGQS AQDIAREVAR QLDERERRLK AKARSNYSDQ GGYDA
|
| |