Gene BCAH820_1419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_1419 
Symbol 
ID7188832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp1352710 
End bp1355049 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content32% 
IMG OID643554831 
Productputative internalin 
Protein accessionYP_002450370 
Protein GI218902536 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein
[COG5386] Cell surface protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones117 
Fosmid unclonability p-value0.131072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATCAA TTATAAAGAT TATATATGAT AGGGGATATT ATTTTGAAAA AAAATATATG 
AAGGCGCTAG TAGTAGCGAC AACATTAGCA ATTCCATTTG CTGCATACTC TACTCCAGCA
TTAGCAGCAA TAAAAATTGA AGCAAACCAA TCGGTAGCAG CGAGTGATCG CACGTATGAT
ACTGAGATTA AAATATATAA AGATCAAAAA GATGAGCCAT CTATGGTTTC TCAATATATA
AAAGATCCTA AAGTGGCGAT TGTAGCTGGG AAAAAAATTG TAACTGTAAC GATGCAAGAT
AGTGATTATT TTCAATATCT TAGAATAGAA GATAGAAATC AACCAGGTGT ATTTCATGAT
GTGAAAGTTT TGTCAGAAGA TAAGAGGAAG AATGGGACGA AAGTAATTCA ATTTGAAATT
GGTGAATTTG AGAAGAAGCA TAATATGCAA ATGCACATAC TTATTCCAGC TATTGGATAC
GATCATAAAT ATCAAGTTCA ATTTGAAATT AAAGATCCAA CTGTAGGTGA CAAAGAAACA
GAGAAACCAG ATGATAACTC TAATTCAAGC AATACGGAAA CGGATAAACC AGTTGATAAT
CAAAATATGA TAACAGATAA CAAGTTAAGG GAACTTGTTA ATAAAAAAGT ATTTAATAGA
AAAGATGTAA ATACACCGAT TACGAAAGAA GAGTTATTAC AAGTAAAGAA TTTGTTTTTA
AATACGAATG AGATTCTTGA TTATAGTGCA TTAAAATATA TGCCAAATTT GAAATCTTTA
ACAGTTGCGA ATGCGAAGAT AAAAGATCCG TCGTTCTTTG CGAACTTAAA GCAATTAAAT
CATTTAGCTT TGCGTGGTAA TGAATTTTCA GATGTAACAC CACTTGTTAA GATGGATCAT
TTAGATTCTC TTGATTTAAG TAATAATAAA ATTACAAACG TTGCACCACT AATTGAAATG
AAAAATGTAA AAAGTTTATA TTTATCAGGT AACCAAATAG AAGATGTAAC AGCATTAGCG
AAAATGGAAC AACTAGATTA TTTGAATTTA GCGAATAATA AAATTACGAA TGTTGCTCCA
TTAAGCGCGT TAAAAAATGT AACATACTTG ACTTTAGCTG GTAATCAAAT TGAAGATATT
AAACCGTTAT ATTCATTACC TTTAACAGAC TTAGTATTAA CACGTAATAA AGTTAAAGAT
TTATCCGGCA TTGAGCAAAT GAAGCAATTA GAAGAATTGT GGATCGGGAA AAATGAAATA
AAAGATGTTA CTCCTCTAAG TAAGATGACA CAGTTAAAAC AATTACACCT ACCTAACAAT
GAGTTAAAGG ATATTACGCC ATTATCAAGT CTAGTAAACT TACAAAAACT TGATTTAGAA
GCAAATTATA TTTCAGACTT AACACCGGCT AGTAATTTGA AAAAGTTAGT ATTCTTAAGT
TTTGTTGCAA ATGAAATTCG TGATGTTCGA CCAGTGATAG AACTAAGTAA AACAGCCTAC
ATCAATGTTC AAAATCAAAA AGTATTTTTA GAGGAAACAG AAGTAAATAA AGAAGTAAAA
GTACCTATAT ACGAAAAAGA CGGTAAAATC TCTACAAAAA TTCGTTTGAA GGACGAAGGT
GGTACGTATA GTAACGATGC AGTTAAGTGG AGTACACCAG GTGAGAAAGT ATATGAATTT
GGTGTGAAAG ATCCATTTGC GGATACAGGA ATCTTCTTTA CGGGATCTGT CATTCAAAAT
GTGGTAGAAA GCAAAGCGGA TAACACTTCT AAAGAAGACA ATACTTCTAA AGAAGATGCA
AAAGTAGAAG TAGTGGAATT TAAAGATGTA CCAAAAGGAC ATTGGTCAGA AGAAGCAATT
CATTATTTAG CGAAAGAAAA TATTTTCAAG GGATATGGAA ATGGACAATT TGGATTTGGG
GATAGTATTA CTCGCGGACA AGTTGCGTCT TTAGTACAAA GGTACTTGAA ATTAGAAAAT
AAAGTAGAGC AGAAAGAGAG ATTTACAGAT ACGAAAGGAC ATATGTTTGA GCAAGATATT
GCTACAGTTG CGCAAGCTGG AATTATGCAA GGAGATGGTA CTGGGGAGTT TCGTCCAGAT
GGAGTATTAA CTCGATACGA AATGTCTGTA GTATTATATA AAGTATTTCA GTTAAAAGAA
GATGGAAATA ATAAAGTGAA CTTTAAAGAT GTACCAACTG GTCATTGGGC AGAAGGGTAT
GTGAAAGCGT TAGTGGATAA TAACATATCA AAAGGTGATG GAAAAGAACG CTTTTTAGGT
GATGATTTTG TAACACGTGA ACAATATGCA CTGTTTTTAT ATAACGCAAT AACAAAATAA
 
Protein sequence
MLSIIKIIYD RGYYFEKKYM KALVVATTLA IPFAAYSTPA LAAIKIEANQ SVAASDRTYD 
TEIKIYKDQK DEPSMVSQYI KDPKVAIVAG KKIVTVTMQD SDYFQYLRIE DRNQPGVFHD
VKVLSEDKRK NGTKVIQFEI GEFEKKHNMQ MHILIPAIGY DHKYQVQFEI KDPTVGDKET
EKPDDNSNSS NTETDKPVDN QNMITDNKLR ELVNKKVFNR KDVNTPITKE ELLQVKNLFL
NTNEILDYSA LKYMPNLKSL TVANAKIKDP SFFANLKQLN HLALRGNEFS DVTPLVKMDH
LDSLDLSNNK ITNVAPLIEM KNVKSLYLSG NQIEDVTALA KMEQLDYLNL ANNKITNVAP
LSALKNVTYL TLAGNQIEDI KPLYSLPLTD LVLTRNKVKD LSGIEQMKQL EELWIGKNEI
KDVTPLSKMT QLKQLHLPNN ELKDITPLSS LVNLQKLDLE ANYISDLTPA SNLKKLVFLS
FVANEIRDVR PVIELSKTAY INVQNQKVFL EETEVNKEVK VPIYEKDGKI STKIRLKDEG
GTYSNDAVKW STPGEKVYEF GVKDPFADTG IFFTGSVIQN VVESKADNTS KEDNTSKEDA
KVEVVEFKDV PKGHWSEEAI HYLAKENIFK GYGNGQFGFG DSITRGQVAS LVQRYLKLEN
KVEQKERFTD TKGHMFEQDI ATVAQAGIMQ GDGTGEFRPD GVLTRYEMSV VLYKVFQLKE
DGNNKVNFKD VPTGHWAEGY VKALVDNNIS KGDGKERFLG DDFVTREQYA LFLYNAITK