Gene GBAA_4551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4551 
SymbolcomEC 
ID2818990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4138587 
End bp4140905 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content36% 
IMG OID637791245 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_021196 
Protein GI47529847 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.76471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAAGGAC AATGGGGCTA CGTTGCAATC TCATTTATAA TAGGGATTGC AATCGCCTTC 
TCCTCTTCAG TTGTATTGCT GACTTGTTGT CTCGGTCTAT ATGTTTTCTT TTGTTTGTAT
CGTACTTCGC GTAAAACCTT CCTATATTGT ATGATAGTGT GTTTTAGTGG CGCTATGTAC
ACCACGTATG TTCAAGGACA AAATAAGCCT CTGGGAGAGT CCTACGAAGC TACAAGAGGA
GTGATTTATA ATACACCTCT TATTAACGGG GATCGCCTAT CATTTCAAGT TGAAGATCAG
AATAAAAATA TAGTGCAGTT AAGTTACAAA ATGAAATCAG CCTCGGAAAA GAAACAAATG
CGACAATTAC ATGCAGGAGT GTCATGTATA TTTGATGGTG AGAGGAAAGA ACCACAAATA
GCTCGGAATT TTCATGGGTT TAATTATCGT GATTATTTAT ATAAGCAAAA TATTCATTTC
ATATTGGAAG CTACATATAT TTCTGAATGC CGTAAAACAT CGTTGTCACT TGTGCAATGG
ATTCTTCTTT TGAGGCAGCA AGCAATCTTA GGAGTTACAG AAATGTTTCC AGAGCAATCA
GGCGCTTTTA TGAACGCGTT ACTATTTGGT GATAGACAAC AAATGACATT TGAAGTTGAA
GGGCAATATC AACAATTCGG TCTTGTGCAT TTGTTGGCGA TTTCAGGATC GCATATCGTA
TTGTTAATGG TGATTGTGTA TTTTATTTTG CTAAGAAGTG GTGTGACAAG GGAGATAGCA
ACAGTATGTC TTATCTTCTT CATTCCTATA TATATGATTT TAGCAGGAGC GTCACCGTCT
GTTATAAGAG CTTCTATAAC AGGAGTTTTA ATGTTAATTG CTTTTATGTG TTCTATTCGT
TTATCTAGCT TAGATGCTTT AAGTATAACA GCTATATGTA TGCTTATATT TGATCCATAT
CTCGTGTTTA ATATTGGCTT TCAGTTTTCT TTTGTTGGGA GTTTTGCTTT ACTTTTATCT
GCCCCGCTCT TACTAGAGAG TGGTAATGGA GTAATTAGAA ATTCTATTTA TATTTCTCTT
ATTTCACAGC TCGTTAGTAC TCCGATTTTG TTATATCACT TCGGTTATTA TTCTCCATAT
AGTATTTTTC TAAATATCCT TTACGTTCCG TTTTTATCTC TCATTGTATT GCCGTGTAGT
ATTATTATTT TGATATGTTT GCCGATCATC CCGTTTCTTG CAAAAAGCTT TGCGAATGTA
CTATCAATAG GTTTGAATCT TTCTAATGAT TTTTTAAGTT ATTGTGAAAG TTTACCATTT
ACCCGTCTTA ATTTCGGGCA AACACCTATA CTTCTTGTAG CCTTATATTG CGTGAGTATT
ATTAGTGTAT TGATGGTTTG GGAAAGGCGA ATATCTAAGG GAATGGTGTT TATATTTGCG
GGCATATTTC TTTTTATTAG TACAGGTCAT TATGTATATC CGTATTTTCG AGAAAGTGGG
AGTGTTACAT TTCTTGATGT TGGCCAGGGG GATGCAATAT TAATTCGCCT CCCGTATGAT
CAAGAGATTT ACCTTATTGA CACTGGTGGA ACAATTCGTT TAAACAAGGA AGAATGGCAA
CGGAAAAAAC ATGAATTTTC TGTTGGAAAT GATGTTTTGA TCCCTTATTT ACAAAAGGAA
GGTATTAAAA AAATTGATAA ATTAATTGTA ACGCATGGAG ATGCAGATCA TATCGGTGCT
GCACAAGAAT TATTATCAAA TATAACCGTA AAAGAAGTTG TATTTGGTCG AAAGGAACAA
GAGGCAATAT TAGAAAAAGC AGTAAAGAAA CAGGCGTTAG AAAAGGAAGT GAAAATAAGT
GAAGTGGGGG AAGGAGAGAG TTGGCGCGTA AATGAAGCGG AATTTTTTGT GCTAGCACCA
ACAGGGAAAG AAAGAAGTGA AAATAACGCT TCAATTGTAC TGTGGGCAAA ATTAGGAGGG
ATAACGTGGC TGTTTACAGG TGATTTAGAA GAAGGAGAGA AGGGTTTAGT AGCTACATAT
CCAGATTTAC GGGCGGATGT TTTAAAGGTT GCTCATCATG GAAGTAATAC GTCATCTATA
ACGCCTTTTT TGAGCGCCGT ACAGCCTAAT ATAGCGATTA TTTCTGTCGG TGAACGGAAT
AGGTATGGGC ACCCTCATAA GGAAGGTATA GAGCGTTTTG AGAAGATGGC GATTGAAATA
TGGCGCACGG ATAAGCAAGG TGCTATTTCC TATGTTTTTA AAGAGGAACG CGGAACGTTT
CGTAGCAAAA TCACATATGA TGAAACACGG AATAGATAA
 
Protein sequence
MQGQWGYVAI SFIIGIAIAF SSSVVLLTCC LGLYVFFCLY RTSRKTFLYC MIVCFSGAMY 
TTYVQGQNKP LGESYEATRG VIYNTPLING DRLSFQVEDQ NKNIVQLSYK MKSASEKKQM
RQLHAGVSCI FDGERKEPQI ARNFHGFNYR DYLYKQNIHF ILEATYISEC RKTSLSLVQW
ILLLRQQAIL GVTEMFPEQS GAFMNALLFG DRQQMTFEVE GQYQQFGLVH LLAISGSHIV
LLMVIVYFIL LRSGVTREIA TVCLIFFIPI YMILAGASPS VIRASITGVL MLIAFMCSIR
LSSLDALSIT AICMLIFDPY LVFNIGFQFS FVGSFALLLS APLLLESGNG VIRNSIYISL
ISQLVSTPIL LYHFGYYSPY SIFLNILYVP FLSLIVLPCS IIILICLPII PFLAKSFANV
LSIGLNLSND FLSYCESLPF TRLNFGQTPI LLVALYCVSI ISVLMVWERR ISKGMVFIFA
GIFLFISTGH YVYPYFRESG SVTFLDVGQG DAILIRLPYD QEIYLIDTGG TIRLNKEEWQ
RKKHEFSVGN DVLIPYLQKE GIKKIDKLIV THGDADHIGA AQELLSNITV KEVVFGRKEQ
EAILEKAVKK QALEKEVKIS EVGEGESWRV NEAEFFVLAP TGKERSENNA SIVLWAKLGG
ITWLFTGDLE EGEKGLVATY PDLRADVLKV AHHGSNTSSI TPFLSAVQPN IAIISVGERN
RYGHPHKEGI ERFEKMAIEI WRTDKQGAIS YVFKEERGTF RSKITYDETR NR