Gene BAS4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4223 
Symbol 
ID2850806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4138961 
End bp4141279 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content36% 
IMG OID637507459 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_030471 
Protein GI49187219 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.396559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAAGGAC AATGGGGCTA CGTTGCAATC TCATTTATAA TAGGGATTGC AATCGCCTTC 
TCCTCTTCAG TTGTATTGCT GACTTGTTGT CTCGGTCTAT ATGTTTTCTT TTGTTTGTAT
CGTACTTCGC GTAAAACCTT CCTATATTGT ATGATAGTGT GTTTTAGTGG CGCTATGTAC
ACCACGTATG TTCAAGGACA AAATAAGCCT CTGGGAGAGT CCTACGAAGC TACAAGAGGA
GTGATTTATA ATACACCTCT TATTAACGGG GATCGCCTAT CATTTCAAGT TGAAGATCAG
AATAAAAATA TAGTGCAGTT AAGTTACAAA ATGAAATCAG CCTCGGAAAA GAAACAAATG
CGACAATTAC ATGCAGGAGT GTCATGTATA TTTGATGGTG AGAGGAAAGA ACCACAAATA
GCTCGGAATT TTCATGGGTT TAATTATCGT GATTATTTAT ATAAGCAAAA TATTCATTTC
ATATTGGAAG CTACATATAT TTCTGAATGC CGTAAAACAT CGTTGTCACT TGTGCAATGG
ATTCTTCTTT TGAGGCAGCA AGCAATCTTA GGAGTTACAG AAATGTTTCC AGAGCAATCA
GGCGCTTTTA TGAACGCGTT ACTATTTGGT GATAGACAAC AAATGACATT TGAAGTTGAA
GGGCAATATC AACAATTCGG TCTTGTGCAT TTGTTGGCGA TTTCAGGATC GCATATCGTA
TTGTTAATGG TGATTGTGTA TTTTATTTTG CTAAGAAGTG GTGTGACAAG GGAGATAGCA
ACAGTATGTC TTATCTTCTT CATTCCTATA TATATGATTT TAGCAGGAGC GTCACCGTCT
GTTATAAGAG CTTCTATAAC AGGAGTTTTA ATGTTAATTG CTTTTATGTG TTCTATTCGT
TTATCTAGCT TAGATGCTTT AAGTATAACA GCTATATGTA TGCTTATATT TGATCCATAT
CTCGTGTTTA ATATTGGCTT TCAGTTTTCT TTTGTTGGGA GTTTTGCTTT ACTTTTATCT
GCCCCGCTCT TACTAGAGAG TGGTAATGGA GTAATTAGAA ATTCTATTTA TATTTCTCTT
ATTTCACAGC TCGTTAGTAC TCCGATTTTG TTATATCACT TCGGTTATTA TTCTCCATAT
AGTATTTTTC TAAATATCCT TTACGTTCCG TTTTTATCTC TCATTGTATT GCCGTGTAGT
ATTATTATTT TGATATGTTT GCCGATCATC CCGTTTCTTG CAAAAAGCTT TGCGAATGTA
CTATCAATAG GTTTGAATCT TTCTAATGAT TTTTTAAGTT ATTGTGAAAG TTTACCATTT
ACCCGTCTTA ATTTCGGGCA AACACCTATA CTTCTTGTAG CCTTATATTG CGTGAGTATT
ATTAGTGTAT TGATGGTTTG GGAAAGGCGA ATATCTAAGG GAATGGTGTT TATATTTGCG
GGCATATTTC TTTTTATTAG TACAGGTCAT TATGTATATC CGTATTTTCG AGAAAGTGGG
AGTGTTACAT TTCTTGATGT TGGCCAGGGG GATGCAATAT TAATTCGCCT CCCGTATGAT
CAAGAGATTT ACCTTATTGA CACTGGTGGA ACAATTCGTT TAAACAAGGA AGAATGGCAA
CGGAAAAAAC ATGAATTTTC TGTTGGAAAT GATGTTTTGA TCCCTTATTT ACAAAAGGAA
GGTATTAAAA AAATTGATAA ATTAATTGTA ACGCATGGAG ATGCAGATCA TATCGGTGCT
GCACAAGAAT TATTATCAAA TATAACCGTA AAAGAAGTTG TATTTGGTCG AAAGGAACAA
GAGGCAATAT TAGAAAAAGC AGTAAAGAAA CAGGCGTTAG AAAAGGAAGT GAAAATAAGT
GAAGTGGGGG AAGGAGAGAG TTGGCGCGTA AATGAAGCGG AATTTTTTGT GCTAGCACCA
ACAGGGAAAG AAAGAAGTGA AAATAACGCT TCAATTGTAC TGTGGGCAAA ATTAGGAGGG
ATAACGTGGC TGTTTACAGG TGATTTAGAA GAAGGAGAGA AGGGTTTAGT AGCTACATAT
CCAGATTTAC GGGCGGATGT TTTAAAGGTT GCTCATCATG GAAGTAATAC GTCATCTATA
ACGCCTTTTT TGAGCGCCGT ACAGCCTAAT ATAGCGATTA TTTCTGTCGG TGAACGGAAT
AGGTATGGGC ACCCTCATAA GGAAGGTATA GAGCGTTTTG AGAAGATGGC GATTGAAATA
TGGCGCACGG ATAAGCAAGG TGCTATTTCC TATGTTTTTA AAGAGGAACG CGGAACGTTT
CGTAGCAAAA TCACATATGA TGAAACACGG AATAGATAA
 
Protein sequence
MQGQWGYVAI SFIIGIAIAF SSSVVLLTCC LGLYVFFCLY RTSRKTFLYC MIVCFSGAMY 
TTYVQGQNKP LGESYEATRG VIYNTPLING DRLSFQVEDQ NKNIVQLSYK MKSASEKKQM
RQLHAGVSCI FDGERKEPQI ARNFHGFNYR DYLYKQNIHF ILEATYISEC RKTSLSLVQW
ILLLRQQAIL GVTEMFPEQS GAFMNALLFG DRQQMTFEVE GQYQQFGLVH LLAISGSHIV
LLMVIVYFIL LRSGVTREIA TVCLIFFIPI YMILAGASPS VIRASITGVL MLIAFMCSIR
LSSLDALSIT AICMLIFDPY LVFNIGFQFS FVGSFALLLS APLLLESGNG VIRNSIYISL
ISQLVSTPIL LYHFGYYSPY SIFLNILYVP FLSLIVLPCS IIILICLPII PFLAKSFANV
LSIGLNLSND FLSYCESLPF TRLNFGQTPI LLVALYCVSI ISVLMVWERR ISKGMVFIFA
GIFLFISTGH YVYPYFRESG SVTFLDVGQG DAILIRLPYD QEIYLIDTGG TIRLNKEEWQ
RKKHEFSVGN DVLIPYLQKE GIKKIDKLIV THGDADHIGA AQELLSNITV KEVVFGRKEQ
EAILEKAVKK QALEKEVKIS EVGEGESWRV NEAEFFVLAP TGKERSENNA SIVLWAKLGG
ITWLFTGDLE EGEKGLVATY PDLRADVLKV AHHGSNTSSI TPFLSAVQPN IAIISVGERN
RYGHPHKEGI ERFEKMAIEI WRTDKQGAIS YVFKEERGTF RSKITYDETR NR