Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_5426 |
Symbol | comFA |
ID | 2818579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 4913158 |
End bp | 4914507 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637792094 |
Product | ComF operon protein 1 |
Protein accession | YP_022088 |
Protein GI | 47530739 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4098] Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.101585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCTG GAAAGCAGTT ACTATTAGAA GAACTCCCTT CAGATTTACG GAGAGAATTA AGTGATTTGA AAAAGGAGGG AGAGGTCATA TGTGTACAAG GCGTAATAAA GAAGGCTTCT AAATATATAT GTCAGCGCTG CGGGAATATA GAGCAGCGGC TATTTGCATC ATTTTTATGT AAAAGGTGCA GTAAAGTATG TACGTATTGC CGGAAGTGTA TAACGATGGG GAGAGTTAGT GAATGTGCTG TACTTGTTCG CGGGATTCAT GAAAGAAAAG GAGAAAGGGA ATTACATTCG TTACAGTGGA AAGGGAGTTT GTCTCTTGGT CAGGAGCTGG CGGCGCAAGG TGTTATAGAA GCTATTAAGC AAAAAGAATC TTTTTTTATT TGGGCTGTGT GCGGGGCTGG AAAAACAGAA ATGTTATTTT ACGGTATAGA AGAGGCACTT CAAAAAGGAG AAAGAGTGTG TATCGCAACG CCAAGGACGG ACGTTGTACT GGAATTAGTA CCGAGATTAC AAGAAGTGTT TCCAAGTATA AATGTAGCTG CTTTATACGG AGGGAGTGTA GATCGTGAAA AAGATGCAGC GTTAGTCGTT GCAACGACGC ATCAATTGTT ACGTTATTAT AGAGCGTTTC ATGTAATGAT TGTAGATGAG ATTGATGCCT TCCCGTATCA TGTGGATCAA ATGTTACAGT ATGCAGTGCA GCAAGCGATG AAAGAGAAAG CAGCGCGTAT TTATTTAACG GCAACCCCTG ATGAAAAGTG GAAGCGCAAT TTCAGAACGG GGAAACAAAA AGGTATCATT GTCTCTGGAC GATACCATCG TCATCCGTTA CCAGTTCCTC TATTTAGTTG GTGCGGAAAT TGGAAGAAAA GCCTCCATCA TAAAAAAATT CCTCGTGTGT TACTACAATG GTTAAAAATG TACGTAAACA AAAAATACCC TATTTTTTTA TTTGTTCCTC ATGTGCGATA TATAGAAGAA ATAGGCCTGT TATTGAAAGG GTTGGATCAT AGAATCGATG GCGTACATGC AGAAGATCCG ATGAGAAAAG AAAAAGTGGA AGCGTTTAGA AAGGGAGACA TTCCGTTATT AGTTACAACG ACAATTTTAG AAAGAGGGGT AATTGTGAAG AACTTACAAG TGGCTGTGTT AGGGGCTGAA GAAGAAATAT TTTCAGAAAG TGCGCTCGTA CAAATTGCAG GCCGAGCAGG TCGTAGTTTT GAAGAACCGT ATGGTGAGGT TGTTTATTTT CATTACGGTA AGACAGAGTC GATGGTACGT GCGAAAAGAC ATATTCAAAG TATGAACAAA AGTGCGAAAG AACAAGGATT AATTGATTAA
|
Protein sequence | MLAGKQLLLE ELPSDLRREL SDLKKEGEVI CVQGVIKKAS KYICQRCGNI EQRLFASFLC KRCSKVCTYC RKCITMGRVS ECAVLVRGIH ERKGERELHS LQWKGSLSLG QELAAQGVIE AIKQKESFFI WAVCGAGKTE MLFYGIEEAL QKGERVCIAT PRTDVVLELV PRLQEVFPSI NVAALYGGSV DREKDAALVV ATTHQLLRYY RAFHVMIVDE IDAFPYHVDQ MLQYAVQQAM KEKAARIYLT ATPDEKWKRN FRTGKQKGII VSGRYHRHPL PVPLFSWCGN WKKSLHHKKI PRVLLQWLKM YVNKKYPIFL FVPHVRYIEE IGLLLKGLDH RIDGVHAEDP MRKEKVEAFR KGDIPLLVTT TILERGVIVK NLQVAVLGAE EEIFSESALV QIAGRAGRSF EEPYGEVVYF HYGKTESMVR AKRHIQSMNK SAKEQGLID
|
| |