Gene SbBS512_E1288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1288 
Symbol 
ID6268375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1177315 
End bp1178907 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content56% 
IMG OID641725409 
Productphage portal protein, lambda family 
Protein accessionYP_001879920 
Protein GI187730612 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAA CGCCTGTCCT GATTGATGTG AACGGCGTTC CGCTTCGTGA GAGTCTCAGC 
TACAACGGGG GCGGCGCAGG ATTTGGTGGG CAAATGGCGG AGTGGTTGCC ACCGGCGCAG
AGTGCCGATG CAGCCCTGCT GCCTGCGTTG CGTCTGGGGA ATGCCCGGGC AGATGATCTG
GTGCGCAATA ACGGGATAGC GGCCAATGCG GTGGCCCTGC ATAAGGATCA TATTGTCGGG
CATATGTTTC TGATCAGCTA CCGTCCGAAC TGGCGCTGGC TGGGGATGCG GGAGACTGCG
GCAAAAAGTT TTGTCGATGA GGTGGAGGCG GCCTGGTCGG AATACGCCGA AGGGATGTTT
GGCGAGATCG ACGTGGAAGA GAAACGCACG TTTACGGAAT TTATTCGTGA AGGTGTGGGC
GTTCATGCGT TTAACGGCGA AATCTTTGTG CAGCCGGTCT GGGATACGGA GAGCACGCAA
CTGTTTCGTA CGCGTTTTAA AGCCGTGAGT CCGAAACGGG TGGACACGCC AGGACACGGT
ATGGGGAACC GTTTTCTGCG GGCCGGTGTG GAGGTCGATC GATATGGTCG TGCCGTTGCG
TACCATATCT GTGAGGATGA TTTTCCGTTC TCTGGGAGTG GACGATGGGA ACGGATCCCG
CGTGAACTTC CCACCGGGCG TCCGGCCATG CTGCATATTT TCGAGCCGGT GGAGGACGGG
CAGACCCGTG GGGCCAATCA GTTTTACAGC GTAATGGAAC GGCTGAAGAT GCTCGATTCC
CTGCAGGCAA CACAGCTTCA GTCGGCCATA GTGAAGGCGA TGTATGCAGC GACGATTGAA
AGTGACCTTG ATACCGAAAA GGCCTTTGAA TATATCGCCG GTGCGCCGCA GGGGCAGAAG
GATAATCCGC TTATTAATAT TCTGGATAAG TTCTCCACCT GGTATGACAC GAATAGCGTG
ACGCTGGGCG GTGTCAAAAT TCCGCACCTT TTCCCCGGTG ATGATCTGAA ACTTCAGACC
GCGCAGGATT CAGACAATGG ATTTTCGGCG CTTGAACAGG CGCTGCTGCG GTATATCGCC
GCCGGTCTTG GCGTTTCCTA CGAACAGTTG TCCCGTGATT ACTCGAAGGT CAGTTATTCA
AGTGCCCGCG CATCCGCCAA TGAGTCGTGG CGCTATTTTA TGGGGCGGCG AAAATTTATT
GCGTCCCGGC TGGCCACGCA GATGTTTTCC TGCTGGCTGG AAGAGGCACT TCTTCGGGGG
ATTATTCGTC CGCCACGGGC ACGTTTTGAT TTTTATCAGG CGCGATCAGC CTGGTCACGG
GCTGAGTGGA TTGGAGCCGG AAGAATGGCC ATTGACGGGC TCAAGGAGGT TCAGGAATCA
GTGATGCGCA TTGAGGCCGG ACTGAGCACG TATGAGAAAG AGCTGGCGCT GATGGGCGAG
GATTATCAGG ACATTTTCCG CCAGCAGGTC AGGGAATCTG CAGAGCGGGA AAAAGCCGGA
CTCTCACGTC CGGTGTGGAT AGCGCAGGCG TATCAGCAGC AGATAGCGGA GAGTCGCAGG
CCGGAAGAGG AGACAACACC ACGTGAGACG TAA
 
Protein sequence
MKRTPVLIDV NGVPLRESLS YNGGGAGFGG QMAEWLPPAQ SADAALLPAL RLGNARADDL 
VRNNGIAANA VALHKDHIVG HMFLISYRPN WRWLGMRETA AKSFVDEVEA AWSEYAEGMF
GEIDVEEKRT FTEFIREGVG VHAFNGEIFV QPVWDTESTQ LFRTRFKAVS PKRVDTPGHG
MGNRFLRAGV EVDRYGRAVA YHICEDDFPF SGSGRWERIP RELPTGRPAM LHIFEPVEDG
QTRGANQFYS VMERLKMLDS LQATQLQSAI VKAMYAATIE SDLDTEKAFE YIAGAPQGQK
DNPLINILDK FSTWYDTNSV TLGGVKIPHL FPGDDLKLQT AQDSDNGFSA LEQALLRYIA
AGLGVSYEQL SRDYSKVSYS SARASANESW RYFMGRRKFI ASRLATQMFS CWLEEALLRG
IIRPPRARFD FYQARSAWSR AEWIGAGRMA IDGLKEVQES VMRIEAGLST YEKELALMGE
DYQDIFRQQV RESAEREKAG LSRPVWIAQA YQQQIAESRR PEEETTPRET