Gene pE33L466_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0234 
Symbol 
ID3399615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp241017 
End bp242228 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content41% 
IMG OID637660065 
Producthypothetical protein 
Protein accessionYP_245729 
Protein GI67078109 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTGCGG AACCAAATCG CGACTTTGAA AATGAACATC ATCCTGAAGC TGAACTAGGG 
AATGCTAAGC CTGCCTTGAA AAAGGCAGGC CATTCTTTAG GAAAGGCGGC TGGAAAAGGA
TCCAAAATAG CGGGTAAAGC AGTTGCACAA GTTGGTAAGA AAGTAGCAGT TAAGGTAGCT
CAAAAGGCAG CTACAGTTGC TGTTGCAAAG CCATTATTAA TAATAGCCGG TGTTATTCTG
GCTGTTGTTG CAGGTTTAGG AATAGTCATA TTTTTATTAA CTGCTACAAT GGGAGAGGAT
GAAGTCAATC CCGGGGGAAT TGGTGGTCCT TTTACACCAG GTACTGCAAG TGTTAGCCCT
GAAGTAATGA GATGGGAGCC ATTGGTGAGA AAGTATGCAG CACAACATGG AGTTGAAGCA
ATGACTCCAT TAATACTAGC TCTTATTCAA CAAGAAAGTT CTGGGACTCA ATTGGATGTA
ATGCAAAGTT CGGAATCACA AGGATATGGT CCAGGATATT TTACAGATCC AGAAGAGAGT
ATTAAATATG GTTTAATGCA TTTCGCTGAT TGTCATAAAA AATCCAACGG CGATCCAAAT
ATTACACTAC AGTGTTATAA CTATGGTACT GGTTATGCAA ACTATGCACT TTCTAACGGT
GGATATACAC ACGCGAATGC AAGAGCATTT TCAGCAGAAC AGACAGCAAA AACTGGTTAT
AAATGTGCTT CTTGGCGAAG TGCAGAAGCA GTAGCCAATA ATTGGTGTTA TGGTGATCCA
GACTATGTTC CACATGTATT AAGATATTAT CAAGGTGGTA GTGGTGGTGG ACCTGTAGCC
GGCGGAGATG AATTATTTAA AAAGGTAATG GATGAAGCAA TTAAATATCA GGGGTGGCCC
TATGTTTGGG CAGGAAGAAC TCCACAAACA TCATTTGATT GTAGTGGACT CATTCAGTGG
GTATATGGGC AGGCCGGTGT TAATCTAAAT GGTACTGCAG AAACACAATT TAAAATTACA
CAGCGAACCA ATGATCCACA ACCAGGAGAT TTAATATTTT TCCAAGGTAC CTATAAACCA
GGTATTTCAC ACGTTGGAAT TTATGTAGGT AATAATCGTA TGTATCATGC GGGAGATCCA
ATTGGTTACG CAGATTTAAG TAATCCTTAT TGGCAACAGC ATTTTGCGGG TTATGGTAAG
GTGGCAAGAT AG
 
Protein sequence
MSAEPNRDFE NEHHPEAELG NAKPALKKAG HSLGKAAGKG SKIAGKAVAQ VGKKVAVKVA 
QKAATVAVAK PLLIIAGVIL AVVAGLGIVI FLLTATMGED EVNPGGIGGP FTPGTASVSP
EVMRWEPLVR KYAAQHGVEA MTPLILALIQ QESSGTQLDV MQSSESQGYG PGYFTDPEES
IKYGLMHFAD CHKKSNGDPN ITLQCYNYGT GYANYALSNG GYTHANARAF SAEQTAKTGY
KCASWRSAEA VANNWCYGDP DYVPHVLRYY QGGSGGGPVA GGDELFKKVM DEAIKYQGWP
YVWAGRTPQT SFDCSGLIQW VYGQAGVNLN GTAETQFKIT QRTNDPQPGD LIFFQGTYKP
GISHVGIYVG NNRMYHAGDP IGYADLSNPY WQQHFAGYGK VAR