Gene BCAH820_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3675 
Symbol 
ID7191744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3509536 
End bp3511464 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content37% 
IMG OID643557086 
Productgroup II intron reverse transcriptase/maturase 
Protein accessionYP_002452625 
Protein GI218904791 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones173 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATA GAACTAAGTC CACGCCAAAG AATAAGAAAC TAAGACACAA CGAATACTAC 
GGTATCCAAA CTTTCTTAGA TAACTTATAC CAGAAGGCAA CAAAGAAAAA TTCTTTTAAA
AATTTAATGC CTATCATTAT ATCAGATGAA AATATACTCC TCGCGTTTCG TAATATTAAA
GGAAACAAAG GGAGTAAAAC AGCAGCTTGT GACAATGTAA ACATTAAGGA TATTGAGAGA
ATGGAACAAA GCTATTTCTT GAACGAAGTA AAAAGACGCT TTCAAAACTA CCAACCGCAG
AAAGTAAGGC GCAAGGAAAT ACCGAAACCC AATGGGAAAA CCAGACCTCT GGGAATACCA
AGTATGTGGG ATAGGATTAT CCAACAATGC ATCTTACAAG TAATGGAGCC TATCTGTGAA
GCACACTTCT GTAACCGAAG TCATGGATTT CGCCCAAACA GAAGTGCTGA AAATGCTATA
GCAGACGCAA CAAAACGGGT AAATCAGCAA AACCTTACAT ATGTGGTGGA TATAGACATT
CAGGGATTCT TTGACGAAGT AAGTCACGTC AAACTCATGC GCCAACTATG GACACTGGGT
ATTCGTGACA AACAACTACT AGTCATTATC CGGAAAATAT TGAAAGCGCC AGTGCAACTG
CCTAACGGCA AAACGATATT TCCGACCAAA GGTACCCCGC AAGGTGGTAT CCTTAGTCCA
CTACTCGCCA ATGTTAACCT TAACGAATTT GACTGGTGGA TTAGTAATCA ATGGGAAACT
TTCAAAGCCA AGAAGGTAAA GCCGAGATTG AAAGATGGGA TTTGGAGTAA TGACACTGTA
ACATATTGTC TATCCAAAAC ATCCAAGATG AAACCAATGT ATATCGTAAG ATACGCAGAT
GATTTTAAAA TCTTTACAAA TACACGTAGG AATGCGGAGA AAATTTTCGA AGCAACTCAA
ATGTGGTTAG AAGAACGTCT AAAACTGCCT ATCTCAACCG AAAAGTCTAA AGTAACCAAT
CTGAAAAAGC AACAGAGTGA ATTCTTAGGT TTCACTCTTA AGGCTGTAAA GAAAGGTAAG
AAGAACGGTA ACACACGATA CATTGCAGTA ACACACATCT CTCCAAAAGC ACTGGAAAAA
ACAAAACAAG ATTTAGCMAA ACAAGTGAAA AGGATACAGA GAACCCCAAA CTCTAATGAA
ACAATTAAGA GAATCAGTAT ATACAACAGC ATGGTCATTG GTAAGCACAA CTATTATAAA
ATAGCAACGC ATGCTTCCCT GGATTTCAGT AAAATGAACC ATAGTCTTGG TCACATGATG
TATAACAGAT TCCCAAAGTC AAAAATTAGA GGAAAAAGCA ACACAAACGG ATACACGAAT
ATAGGGAAAT ATAAAGGGAA AGACAGAGGT ATTAAACCAT ACCTAAAGTC AAAAGCGATG
AGATTTCTCA TGAAATGTCC TATTTTACCA ATCTCCTATA TTCAACACAA AAAACCGATG
ATGAAAAAGC AATCTATTAA CAGATACACC GCAGAAGGAC GAGCCCTTAT ACACAAAAAC
TTGGCAGAAA TAACCGAAGC GGAACTGAAA TGGTTAAGAG AAAACCCAGT TATAAATGAA
CGGGCAACCA TAGAATACAA TGATAATCGA ATTTCTCTTT ATATTGCACA AAAAGGAAGA
TGCAGTGTAA CAGGTGAGAA ACTCTCGCCT TGGGACATCC ACTGTCACCA TAAGCGATTA
TGGAGTGAAA CAAGGGATGA CAGCTACAAA AATCTTACCA TCATCAAGCC AAGTGTCCAT
ACACTAATAC ACGCGACCAA TATAGAAACC ATAAACCAAT TCCTCAATAA ATTAAAACTC
AACGAGGAAC AGTTAGGCAA ACTTAATAAA TTGCGGAAAT TAGTCAAAAA TGAGGAAATC
TGTATGTAA
 
Protein sequence
MNDRTKSTPK NKKLRHNEYY GIQTFLDNLY QKATKKNSFK NLMPIIISDE NILLAFRNIK 
GNKGSKTAAC DNVNIKDIER MEQSYFLNEV KRRFQNYQPQ KVRRKEIPKP NGKTRPLGIP
SMWDRIIQQC ILQVMEPICE AHFCNRSHGF RPNRSAENAI ADATKRVNQQ NLTYVVDIDI
QGFFDEVSHV KLMRQLWTLG IRDKQLLVII RKILKAPVQL PNGKTIFPTK GTPQGGILSP
LLANVNLNEF DWWISNQWET FKAKKVKPRL KDGIWSNDTV TYCLSKTSKM KPMYIVRYAD
DFKIFTNTRR NAEKIFEATQ MWLEERLKLP ISTEKSKVTN LKKQQSEFLG FTLKAVKKGK
KNGNTRYIAV THISPKALEK TKQDLAKQVK RIQRTPNSNE TIKRISIYNS MVIGKHNYYK
IATHASLDFS KMNHSLGHMM YNRFPKSKIR GKSNTNGYTN IGKYKGKDRG IKPYLKSKAM
RFLMKCPILP ISYIQHKKPM MKKQSINRYT AEGRALIHKN LAEITEAELK WLRENPVINE
RATIEYNDNR ISLYIAQKGR CSVTGEKLSP WDIHCHHKRL WSETRDDSYK NLTIIKPSVH
TLIHATNIET INQFLNKLKL NEEQLGKLNK LRKLVKNEEI CM