Gene BCAH820_4372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4372 
Symbol 
ID7188939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4148212 
End bp4149828 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content36% 
IMG OID643557783 
Productputative minor structural protein 
Protein accessionYP_002453321 
Protein GI218905487 
COG category[S] Function unknown 
COG ID[COG4926] Phage-related protein 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones200 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGTAG TAAAAGGGAT TAATAATCAA GAAGAAATGC TGACAGATTA TAAAGAGGTG 
AAGAGAAAGA GACGTGTGAA CGGAGAACAC TCTCTTTCTT TTTATTTATT GAATACTCCT
AATGTAAAAC ATGCTTATCA TCTTGTAGAT AAGCGAGCGA GTATTCTCGA TCAATCTGAT
GAGTATATCG CATTAGGAAT TAATAAGCGT GGACATTATG GGAAACTCAT TACAGCGCCG
CATATTTTTT TCGATGATAT GATGGAGCAT CAATACAATT TATATAACGG ATACGCCAAT
TTTAAACAGT GCATGGAGTT CATTTTTAAT GGGACAGGCT GGAAATATGT TAATCAAGGA
GCGTTTTCAG CAACGAAATT TGAAAACTTT GGTGATGACA CACGGTCAGC GCTTTTACAA
AAGGCTTTAA ATCGTTATGA AGCTGAGATG GAAATTAATT ATTCATCTAA AACAGTTACA
TTTAAAAATC AAATCGGAAA AGAAACTGAT GCACAGTTCC GTTATGGCCA TAACCTTAAA
ACGTTTGAAG AAGATACGGA TATGACTAAT TTTGCTACAT ATATTAGGGG TTACGGTAAA
GATGCAGCTG GCAATGAATT TATGGTGGAA TATGAATCAC CAATGGCTAA AGTATATGGG
CGTATTCATC AAAAACCAAT TCGTGATGAG CGGTATAAAA CAAAAGAGTC ATTATTAGAA
GCTTGTAAAA AAGCAATAAA TGACGTGCCA GATACCAGGT TTAAAGTTAG TATTGTTAGT
TTAATAGAAA ATGGGTTAAG TCCACTTCAT AAATTCGATT TAGGTGACTA TGTATACATG
TTGTATGAAG AAGCTGATGT AAAGGTGAAA ATTCGTGTGA TTCAGATTGA AGACGATCCG
ACAGATCCAA CTAAAACACC GATAGTTGAA TTATCAACCT TTAAAGAATT AAAGACAGCA
AGTGCAGTTC AGGCACAGTT TCAACAAACG CAGAAGCAAG TGCAAAAATT ACTAGATGAT
GGTGGTAATT TAAATCTAGC GTTAAAACGT CTGTATATGA ACACACATGT TTTTCAAGAC
GATACAGGTA TGTGGATGGT AGATCCAGAG AACCCAAATC GATACGTTCA TCATGGCGCT
GGTGGTAGTG ATTATCATGG CGGGATGATA CGTATTGAGC GCCCAGATGG ATATGCAACA
ATCATAGATG GTTATTTACA ATATGGATTT GATATTGCAG GGCATTATCC ACCATATCGC
GGGATAAACG TAGTAGAAGA TGGTTGGTGG TTAACGTCTA CCCATGATGT CTTAGATAGT
TGCCAATTTT ACACATTTGA ACATAAAACA AGATATGTAA AATTAAAGGC ACAGATTTTC
ACTGAAGAAG GTGGGGAAGT TGAAATCGCA ATGGTTTCTT CCGATAGTGG TCAGCAAATC
ATGAGCAAAG CTTCATCCAA ACAGACTACG GCACCACCAC AAAATGATGA TGTGGATCTT
TCGTATGATT TAGGTGTTCC AACTGGGGAA CTTAAAAGTT TTTATTTACG AATGAGAAAT
AAGGTAAAAG GAAAGAAAGC ATATGCACGG ATATTCCGTG TGTGGCTAGA AAAATAA
 
Protein sequence
MLVVKGINNQ EEMLTDYKEV KRKRRVNGEH SLSFYLLNTP NVKHAYHLVD KRASILDQSD 
EYIALGINKR GHYGKLITAP HIFFDDMMEH QYNLYNGYAN FKQCMEFIFN GTGWKYVNQG
AFSATKFENF GDDTRSALLQ KALNRYEAEM EINYSSKTVT FKNQIGKETD AQFRYGHNLK
TFEEDTDMTN FATYIRGYGK DAAGNEFMVE YESPMAKVYG RIHQKPIRDE RYKTKESLLE
ACKKAINDVP DTRFKVSIVS LIENGLSPLH KFDLGDYVYM LYEEADVKVK IRVIQIEDDP
TDPTKTPIVE LSTFKELKTA SAVQAQFQQT QKQVQKLLDD GGNLNLALKR LYMNTHVFQD
DTGMWMVDPE NPNRYVHHGA GGSDYHGGMI RIERPDGYAT IIDGYLQYGF DIAGHYPPYR
GINVVEDGWW LTSTHDVLDS CQFYTFEHKT RYVKLKAQIF TEEGGEVEIA MVSSDSGQQI
MSKASSKQTT APPQNDDVDL SYDLGVPTGE LKSFYLRMRN KVKGKKAYAR IFRVWLEK