Gene SbBS512_E1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1456 
Symbol 
ID6272127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1329345 
End bp1330937 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content56% 
IMG OID641725557 
Productphage portal protein, lambda family 
Protein accessionYP_001880063 
Protein GI187731680 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAA CGCCTGTCCT GATTGATGTG AACGGCGTTC CGCTTCGGGA GAGCCTCAGC 
TACACCGGTG GCGGTGCAGG ATTTGGCGGG CAAATGGCAG AGTGGTTGCC ACCCTCGCAG
AGTGCCGATG CGGCCCTGCT GCCCGCGTTG CGTCTGGGGA ATGCCCGTGC AGATGATCTG
GTGCGCAATA ACGGAATAGC GGCCAATGCA GTGGCCCTGC ATAAGGATCA CATTGTCGGG
CATATGTTTC TGATTAGCTA CCGTCCGAAC TGGCGCTGGC TGGGGATGCG GGAGACCGCG
GCAAAAAGTT TTGTCGATGA GGTGGAGGCG GCCTGGTCAG AATACGCAGA AGGGATGTTT
GGTGAGATCG ACGTGGAAGG GAAACGCACG TTTACGGAAT TTATCCGTGA AGGTGTGGGC
GTTCATGCGT TTAACGGCGA AATCTTTGTG CAGCCGGTCT GGGATACGGA GAGTACGCAA
CTGTTTCGTA CGCGTTTTAA AGCCGTGAGT CCGAAACGGG TGGACACGCC AGGACACGGT
ATCGGGAACC GTTTTCTGCG GGCCGGTGTG GAGGTTGATC GATATGGCCG TGCCGTTGCG
TACCATATCT GTGAGGATGA TTTTCCTCGC TCCGGGAGTG GACGATGGGA ACGGATCCCG
CGTGAACTAC CCACCGGGCG TCCGGCCATG CTGCATATTT TCGAGCCGGT GGAGGACGGG
CAGACCCGTG GAGCCAATCA GTTTTACAGC GTTATGGAAC GGCTGAAGAT GCTGGATTCC
CTGCAGGCAA CACAGCTTCA GTCGGCCATA GTGAAGGCGA TGTATGCAGC GACGATTGAA
AGTGACCTTG ATACCGAAAA GGCCTTTGAA TATATCGCCG GTGCGCCGCA GGGGCAGAAG
GATAATCCGC TTATTAATAT TCTGGATAAG TTCTCCACCT GGTATGACAC GAATAGCGTG
ACGCTGGGCG GTGTCAAAAT TCCGCACCTT TTCCCCGGTG ATGATCTGAA ACTTCAGACC
GCGCAGGATT CAGACAATGG ATTTTCGGCG CTTGAACAGG CGCTGCTGCG GTATATCGCC
GCCGGTCTTG GCGTTTCCTA CGAACAGTTG TCCCGTGATT ACTCGAAGGT CAGTTACTCA
AGTGCCCGCG CATCCGCCAA TGAGTCGTGG CGCTATTTTA TGGGGCGGCG AAAATTTATT
GCGTCCCGGC TGGCCACGCA GATGTTTTCC TGCTGGCTGG AAGAGGCACT TCTTCGGGGG
ATTATTCGTC CGCCACGGGC ACGGTTTGAT TTTTATCAGG CGCGATCAGC CTGGTCACGG
GCTGAGTGGA TTGGAGCCGG AAGAATGGCC ATTGACGGGC TCAAGGAGGT TCAGGAATCA
GTGATGCGCA TTGAGGCCGG ACTGAGCACG TATGAGAAAG AGCTGGCGCT GATGGGCGAG
GATTATCAGG ACATTTTCCG CCAGCAGGTC AGGGAATCTG CAGAGCGGGA AAAAGCCGGA
CTCTCACGTC CGGTGTGGAT AGCGCAGGCG TATCAGCAGC AGATAGCGGA GAGTCGCAGG
CCGGAAGAGG AGACAACACC ACGTGAGACG TAA
 
Protein sequence
MKRTPVLIDV NGVPLRESLS YTGGGAGFGG QMAEWLPPSQ SADAALLPAL RLGNARADDL 
VRNNGIAANA VALHKDHIVG HMFLISYRPN WRWLGMRETA AKSFVDEVEA AWSEYAEGMF
GEIDVEGKRT FTEFIREGVG VHAFNGEIFV QPVWDTESTQ LFRTRFKAVS PKRVDTPGHG
IGNRFLRAGV EVDRYGRAVA YHICEDDFPR SGSGRWERIP RELPTGRPAM LHIFEPVEDG
QTRGANQFYS VMERLKMLDS LQATQLQSAI VKAMYAATIE SDLDTEKAFE YIAGAPQGQK
DNPLINILDK FSTWYDTNSV TLGGVKIPHL FPGDDLKLQT AQDSDNGFSA LEQALLRYIA
AGLGVSYEQL SRDYSKVSYS SARASANESW RYFMGRRKFI ASRLATQMFS CWLEEALLRG
IIRPPRARFD FYQARSAWSR AEWIGAGRMA IDGLKEVQES VMRIEAGLST YEKELALMGE
DYQDIFRQQV RESAEREKAG LSRPVWIAQA YQQQIAESRR PEEETTPRET