Gene pE33L466_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0077 
Symbol 
ID3399597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp80256 
End bp82349 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content35% 
IMG OID637659916 
ProductS-layer protein 
Protein accessionYP_245580 
Protein GI67077960 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0803176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA TTAAGAAATT ATTTTTAAGT TTTATGGTAT GTATTTTGTT ATTCAGTACA 
GTAGGAACAG CGTATGCAGA ACAAAATGGT CAAGTGAATT CATTATCTTT TATTGACGTT
CCTAAAACTC ACTGGGCGTA TAAAGAAATG ATGTATATGG CGGAAAATAA GATTATAACG
GGATACGGAA ATGGGTATTT CGGTGCAGCC GATCTTATTA CCCGCGAACA TCTCGCTGCT
TTCTTGTATC GATATTTAAA ACCGCAAGAT AGTACAAATA ATCCATTTGT TGATATTGGT
GACAGCAATT TCAAAAAGGA AATTTTGGCA CTAACAGCAC GTGGCATTTT CAGTGTAAAT
GCTGAAAAAA AATTCAATCC AAAAAATAAT ATGACACGTG CCGAAATGGC AACTGTGCTT
GTCAAAGCAT TTGATTTAAA GCCGCAAGGA AATGTTGAAT TTACGGATAT GAAAGGTCAC
TGGGCCAACG AATATGTCAA AATATTAGCC GGTAACAACA TAACAAGTGG TACAGGTGAC
GGCAACTTTA ATCCAAATGG CATTGTTACT AGAGAGCAAT TTTCTATGTT CTTATATAGG
ACGATTATGA AAGTAACAGA TATGCAAAAT GATGAATCAG TTGGCTGGGT GAAGGAAAAA
GAGAATTGGT ACTATTACAA CAAAGATGGG AGTAAGCAAA GGGATAACAT TACGTTAGAT
GGAATAGAGT ACTCCTTTTT TAAAGATGGA AGGCTATTCC AAGGAAGAAA GCAGGTAGGG
AAAGATACTT TATATTATAG TGAGCCTGGT AAATTAAAGA CTGGATGGAG TTTTTCTGCA
ACTTCATGGA GCTATTTAAA AGATGGGAAG TATGTGACAG GAACATTTAC TTATCAAGGA
AAACCATTTG AAATTAATAA GTACGGGGAT ATGGAAAAAG GATGGATTAC TCTTCGTTCT
GCTGTTAAAA GAATCTATCC AAAGCCAGAA ACAAAATTTC TATTGAAATC TAAGCCTGTA
AAAGATGGAG ATGTTTTAGA AGTTATATCC AAGCAAGGTT TATGGTATCA GGTCAAATAT
CAGGGAGAGG TTGGATATGT TCGAATACTG GAGTCTGTTG TAATTGGTGA ATCACCAGTT
CGTTCATGGG ATGTAGCAAA GGAAGCAACG AATTTGTCTC ATTTTATGAT AACTGAGTAT
CATAAAGATC CAGAAAAATA CTTCCCAAAA AATATTGAAA AGAAATTTGA TAAACAATTA
GATAGTGATT TAACTCTTCT TGCGAATGGA TTACAATGGA TTGATCAATT AAAAGAAGCA
CTCTATTTAG ATAATAAACA AGGATGGGTA CAAGAAGAAG GAAAATGGAC GTATTATAAA
AAAGATGGGC AAAGAATAAC AGGTTTTCAA TCAATTGATG GAAAACGTTA TTATTTGGGA
ACCGATGGTT TTATGCAAAC TGGATGGGTA CATACAAATG GATACGATTA TTATTTGGGA
ACCGATGGCG TGGTACAAAC TGGAATCCAA CATATTGATG GAAAAATCTA TTACTTTGGT
CCGTTAGGTC CTGTACAATC AGGATTTACA CATGTTGATG GGAAACCGTA TTATTTTGAT
GCTTCTCATG AATCAAGAAA TGGTTGGATG AAACAAGAGT TTAATTGGTA TTTACTACAG
CCTTCAGGTG CATTACAAAC AGGAGATTTT ACATATAAAG ACAAGAAATT CTCCTTTAAT
CAAGACGGTG AAATGATTAA AGGGTGGGTT ACATTAGAAT CCCTTGTTAA AAAAGTATAT
CCTGAGCCAG ACTTGAAAAA AGCATTACGT TCAAAAAGTG TGAATAAGGG TGAAGTGATA
GAAGTAGTGG GAAAAGTCGG TTCTTGGTAT GAGGTTAATT ACCAAGGTGA AAAAGGATAT
GTGCGTATTC ATGATGCCAT TATTTTTGAT CAGGAGGCTA AAAGTCCACT TACTTTGCTT
GATGGAAAAG TAAAGATATT TGAAGGTGTA TTAGATTATT TAAAAAGTGA CGAACCAGTC
ATAAATAACA TGATTAAAAT TTTAGAAGAA GAAGAACAGC GTTGGTTGGG GTAG
 
Protein sequence
MNTIKKLFLS FMVCILLFST VGTAYAEQNG QVNSLSFIDV PKTHWAYKEM MYMAENKIIT 
GYGNGYFGAA DLITREHLAA FLYRYLKPQD STNNPFVDIG DSNFKKEILA LTARGIFSVN
AEKKFNPKNN MTRAEMATVL VKAFDLKPQG NVEFTDMKGH WANEYVKILA GNNITSGTGD
GNFNPNGIVT REQFSMFLYR TIMKVTDMQN DESVGWVKEK ENWYYYNKDG SKQRDNITLD
GIEYSFFKDG RLFQGRKQVG KDTLYYSEPG KLKTGWSFSA TSWSYLKDGK YVTGTFTYQG
KPFEINKYGD MEKGWITLRS AVKRIYPKPE TKFLLKSKPV KDGDVLEVIS KQGLWYQVKY
QGEVGYVRIL ESVVIGESPV RSWDVAKEAT NLSHFMITEY HKDPEKYFPK NIEKKFDKQL
DSDLTLLANG LQWIDQLKEA LYLDNKQGWV QEEGKWTYYK KDGQRITGFQ SIDGKRYYLG
TDGFMQTGWV HTNGYDYYLG TDGVVQTGIQ HIDGKIYYFG PLGPVQSGFT HVDGKPYYFD
ASHESRNGWM KQEFNWYLLQ PSGALQTGDF TYKDKKFSFN QDGEMIKGWV TLESLVKKVY
PEPDLKKALR SKSVNKGEVI EVVGKVGSWY EVNYQGEKGY VRIHDAIIFD QEAKSPLTLL
DGKVKIFEGV LDYLKSDEPV INNMIKILEE EEQRWLG