Gene SbBS512_E2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2489 
Symbol 
ID6269034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2288485 
End bp2291562 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content57% 
IMG OID641726481 
Productphage tail tape measure protein 
Protein accessionYP_001880961 
Protein GI187733122 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACA ATAACCTGCG TCTGCAGGTG ATTCTTAATG CGGTTGACAA ACTCACCCGC 
CCATTTCGAT CCGCGCAGGC CAGTTCAAGA GAACTGGCTG CTGCTGTCAA AAAATCCCGC
GATGCAATAA AGCAGCTTGA TCAGGCCGGG AGCAGTCTGG ACAGCTTCCG AAAGCTGCAG
GCAGAAAATC AGAAATTAGG CGACAGGCTG AACTATGCCC GCCAGCGTGC AAATTTGCTC
AGTCAGGAAC TGGGAGCGAT GGGGCCGCCT TCGCAACGTC AGGTTGTTGC TCTGGGCCGT
CAACGGCTGG CTGTTCAGCG CCTGGAAGAA CGCCAGAAAA AGCTGCAGCA GCAGACGGCG
CTTGTGCGTG CAGAACTTTA TCGTGCTGGT ATTTCAGCTA ATGATGGCGC CAGTGCGACG
GCCCGCATTA CCCGTGAAAC GATGCGTTAT AACAGGCAGC TTTCTGAGCA GGAAGCGAGG
TTACGACGTG TCGGGGAGCA ACAGCGAAAA ATGCACGCCG CCCGGGGGGC TTACGCCAGG
CGTCTTGAGG TAAGGGATCG TATTGCAGGT GCCGGAGCCA CCACCACGGC TGCAGGGCTG
GCAATGGGCG CACCAGTGAT GGCGGCAGTA AAAAGCTATA CCAGCATGGA AGATGCCATG
AAAGGTGTGG CAAAGCAGGT CAATGGTCTG CGTGACGATA ATGGCAACCG CACTGCGCGT
TTTTACGAAA TGCAGGGTGC CATCAAGGCT GCCAGCGAAC AGTTGCCGAT GGAAAACGGT
GCTGTGGACT TCGCCGCACT GGTTGAAGGT GGTGCGCGCA TGAACGTCGC AAACCCTGAC
GACAGCTGGG AAGACCAGAA ACGTGACCTG CTGGCCTTCG CCAGCACGGC GGCAAAGGCG
GCAACAGCCT TTGAACTGCC AGCGGATGAA CTGTCAGAAA GTCTGGGGAA AATCGCCCAG
CTCTACAAAA TCCCTACCCG CAATATTGAG CAGCTCGGTG ATGCGCTGAA CTATCTGGAT
GATAACGCCA TGTCGAAAGG GGCGGACATC ATTGATGTCA TGCAACGCCT GGGCGGTGTG
GCTGACCGTC TGGATTATCG TAAAGCGGCG GCGCTGGGTT CCACCTTCCT GACACTCGGC
GCTGCGCCTG AGGTTGCAGC CAGTGCAGCA AACGCGATGG TGCGTGAATT GTCCATTGCC
ACCATGCAAA GCAAGAGTTT CTTTGAAGGG ATGAATCTGC TGAAACTCAA TCCAGAAGTG
ATTGAAAAGC AGATGACGAA GGATGCGATG GGAACTATCC AGCGGGTGCT GGAGAAGGTG
AATGCGCTGC CACAGGATAA GCGCCTGTCT GCCATGACCA TGTTGTTTGG TAAAGAGTTT
GGCGATGACG CAGCGAAACT GGCAAACAAC CTGCCGGAAC TTCAGCGCCA GTTAAAACTG
ACTGCGGGCA ATGATGCGCT CGGTTCCATG CAGAAAGAAT CCGACATCAA CAAGGATTCA
CTTTCTGCTC AGTGGTTGCT GGTCAAAACC GGAGCACAGA ACACCTTCAG CAGCCTGGGG
GAAACGCTGC GCCAGCCGCT GATGGATATT CTGTACACGG TGAAAAGCGT CACGGGGGCG
TTGCGTCGCT GGGTGGAAGC TAACCCGGAA CTGACGGGCA CTCTGATGAA AGTAGCCGCT
GTGGTAGCTG CCGTTACCGT GGGCCTCGGC ACCTTAGCGG TGGCGCTGGC TGCAGTGCTG
GGGCCGCTGG CAGTCATCCG TCTGGGATTC TCTGTGCTGG GTATCAAAAC GTTACCTTCC
GTTACGGCAG CAGTAACCCG AACCAGCAAC GCGTTGTCCT GGCTGGCTGG CGCACCACTG
GCACTGCTGC GACGCGGGCT TGCTTCATCG GGCAACGCTG CAGGTTTACT TACTGCGCCG
TTGTCGTCTT TGCGCCGCAC GGCATCACTG ACGGGAAATG TCCTGAAAAC TGTAGCAGGT
GCGCCGGTTG CACTTTTGCG GTCTGGATTA TCCGGTTTAC GTGCGGTTGC TGTGATGTTT
ATGAATCCAC TGGCGGTACT ACGCGGCGGA CTGGCTGCCA CAGGCACGGT GCTGCGTATG
CTCGCATCCG GTCCACTGGC GATGCTGCGC GTTGCCCTGT ATGCCGTCTC TGGTCTGTTA
GGTGCTCTGC TCAGTCTGAT AGGTCTTGTG GTTACTGCAC TGGCGGGTGT GGCGCTGGTT
GTCTGGAAAT ACTGGCAGCC CATCACCGCA TTTCTCGGTG GCGTGGTGGA AGGATTCAAA
GCGGCGGCAG GTCCTGTCAG TGCCGCGTTC GAACCGCTTA AGCCTGTGTT CCAGTGGATT
GGCGACAAAG TACAGGCGCT GTGGGGCTGG TTTACTGATC TGCTGACGCC CGTTAAGTCG
ACCTCTGCCG AACTGCAGAG TGCAGCGGCA ATGGGGCGAC GATTCGGGGA GGCTCTGGCG
GAAGGGCTGA ATATGGTCAT GCATCCGCTG GACTCCCTGA AATCCGGCGT TTCCTGGTTG
CTGGAGAAAC TTGGCATTGT CAGTAAAGAG GCCGCAAAGG CGAAACTGCC GGAAAGCGTG
ACGCGTCAGC AACCTGCGAC GGTGAATGCA GACGGTAAAG TGATGATGCC ATCGGGTGGT
TTTCCGTCAT GGGGATATGG CTTTGCGGGG ATGTATGACA GCGGCGGGTA TATCCCGCGC
GGGCAGTTTG GCATCGTCGG TGAAAACGGG CCGGAAATTG TTAACGGCCC GGCAAATGTG
ACCAGCCGGA GAAATACAGC TGCACTGGCT GCCGTTGTTG CCGGAATGAT GGGTGTTGCT
GCTACGCCAG CAGAGCTTCC ACCGTTGCAC CCTTTGGCAC TTCCCGCGAA AGGCGGTGAA
GCGATGGTGA GTCGTGCAGC CACTGTGCCG CCCGTTCAAC GGATTGAGGC ACCGATGCAG
ATCATCATTC AAACGCAGCC AGGACAAAGT GCGCAGGATA TTGCGCGGGA GGTGGCCCGC
CAGCTTGATG AACGTGAACG CAGGCTGAAG GCAAAAGCCA GGAGTAACTA CAGCGATCAG
GGGGGATACG ACGCATGA
 
Protein sequence
MSDNNLRLQV ILNAVDKLTR PFRSAQASSR ELAAAVKKSR DAIKQLDQAG SSLDSFRKLQ 
AENQKLGDRL NYARQRANLL SQELGAMGPP SQRQVVALGR QRLAVQRLEE RQKKLQQQTA
LVRAELYRAG ISANDGASAT ARITRETMRY NRQLSEQEAR LRRVGEQQRK MHAARGAYAR
RLEVRDRIAG AGATTTAAGL AMGAPVMAAV KSYTSMEDAM KGVAKQVNGL RDDNGNRTAR
FYEMQGAIKA ASEQLPMENG AVDFAALVEG GARMNVANPD DSWEDQKRDL LAFASTAAKA
ATAFELPADE LSESLGKIAQ LYKIPTRNIE QLGDALNYLD DNAMSKGADI IDVMQRLGGV
ADRLDYRKAA ALGSTFLTLG AAPEVAASAA NAMVRELSIA TMQSKSFFEG MNLLKLNPEV
IEKQMTKDAM GTIQRVLEKV NALPQDKRLS AMTMLFGKEF GDDAAKLANN LPELQRQLKL
TAGNDALGSM QKESDINKDS LSAQWLLVKT GAQNTFSSLG ETLRQPLMDI LYTVKSVTGA
LRRWVEANPE LTGTLMKVAA VVAAVTVGLG TLAVALAAVL GPLAVIRLGF SVLGIKTLPS
VTAAVTRTSN ALSWLAGAPL ALLRRGLASS GNAAGLLTAP LSSLRRTASL TGNVLKTVAG
APVALLRSGL SGLRAVAVMF MNPLAVLRGG LAATGTVLRM LASGPLAMLR VALYAVSGLL
GALLSLIGLV VTALAGVALV VWKYWQPITA FLGGVVEGFK AAAGPVSAAF EPLKPVFQWI
GDKVQALWGW FTDLLTPVKS TSAELQSAAA MGRRFGEALA EGLNMVMHPL DSLKSGVSWL
LEKLGIVSKE AAKAKLPESV TRQQPATVNA DGKVMMPSGG FPSWGYGFAG MYDSGGYIPR
GQFGIVGENG PEIVNGPANV TSRRNTAALA AVVAGMMGVA ATPAELPPLH PLALPAKGGE
AMVSRAATVP PVQRIEAPMQ IIIQTQPGQS AQDIAREVAR QLDERERRLK AKARSNYSDQ
GGYDA