Gene BAS5042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5042 
Symbol 
ID2853085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4914369 
End bp4915724 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content40% 
IMG OID637508297 
ProductcomF operon protein 1 
Protein accessionYP_031281 
Protein GI49188028 
COG category[L] Replication, recombination and repair 
COG ID[COG4098] Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGATGC TCGCTGGAAA GCAGTTACTA TTAGAAGAAC TCCCTTCAGA TTTACGGAGA 
GAATTAAGTG ATTTGAAAAA GGAGGGAGAG GTCATATGTG TACAAGGCGT AATAAAGAAG
GCTTCTAAAT ATATATGTCA GCGCTGCGGG AATATAGAGC AGCGGCTATT TGCATCATTT
TTATGTAAAA GGTGCAGTAA AGTATGTACG TATTGCCGGA AGTGTATAAC GATGGGGAGA
GTTAGTGAAT GTGCTGTACT TGTTCGCGGG ATTCATGAAA GAAAAGGAGA AAGGGAATTA
CATTCGTTAC AGTGGAAAGG GAGTTTGTCT CTTGGTCAGG AGCTGGCGGC GCAAGGTGTT
ATAGAAGCTA TTAAGCAAAA AGAATCTTTT TTTATTTGGG CTGTGTGCGG GGCTGGAAAA
ACAGAAATGT TATTTTACGG TATAGAAGAG GCACTTCAAA AAGGAGAAAG AGTGTGTATC
GCAACGCCAA GGACGGACGT TGTACTGGAA TTAGTACCGA GATTACAAGA AGTGTTTCCA
AGTATAAATG TAGCTGCTTT ATACGGAGGG AGTGTAGATC GTGAAAAAGA TGCAGCGTTA
GTCGTTGCAA CGACGCATCA ATTGTTACGT TATTATAGAG CGTTTCATGT AATGATTGTA
GATGAGATTG ATGCCTTCCC GTATCATGTG GATCAAATGT TACAGTATGC AGTGCAGCAA
GCGATGAAAG AGAAAGCAGC GCGTATTTAT TTAACGGCAA CCCCTGATGA AAAGTGGAAG
CGCAATTTCA GAACGGGGAA ACAAAAAGGT ATCATTGTCT CTGGACGATA CCATCGTCAT
CCGTTACCAG TTCCTCTATT TAGTTGGTGC GGAAATTGGA AGAAAAGCCT CCATCATAAA
AAAATTCCTC GTGTGTTACT ACAATGGTTA AAAATGTACG TAAACAAAAA ATACCCTATT
TTTTTATTTG TTCCTCATGT GCGATATATA GAAGAAATAG GCCTGTTATT GAAAGGGTTG
GATCATAGAA TCGATGGCGT ACATGCAGAA GATCCGATGA GAAAAGAAAA AGTGGAAGCG
TTTAGAAAGG GAGACATTCC GTTATTAGTT ACAACGACAA TTTTAGAAAG AGGGGTAATT
GTGAAGAACT TACAAGTGGC TGTGTTAGGG GCTGAAGAAG AAATATTTTC AGAAAGTGCG
CTCGTACAAA TTGCAGGCCG AGCAGGTCGT AGTTTTGAAG AACCGTATGG TGAGGTTGTT
TATTTTCATT ACGGTAAGAC AGAGTCGATG GTACGTGCGA AAAGACATAT TCAAAGTATG
AACAAAAGTG CGAAAGAACA AGGATTAATT GATTAA
 
Protein sequence
MLMLAGKQLL LEELPSDLRR ELSDLKKEGE VICVQGVIKK ASKYICQRCG NIEQRLFASF 
LCKRCSKVCT YCRKCITMGR VSECAVLVRG IHERKGEREL HSLQWKGSLS LGQELAAQGV
IEAIKQKESF FIWAVCGAGK TEMLFYGIEE ALQKGERVCI ATPRTDVVLE LVPRLQEVFP
SINVAALYGG SVDREKDAAL VVATTHQLLR YYRAFHVMIV DEIDAFPYHV DQMLQYAVQQ
AMKEKAARIY LTATPDEKWK RNFRTGKQKG IIVSGRYHRH PLPVPLFSWC GNWKKSLHHK
KIPRVLLQWL KMYVNKKYPI FLFVPHVRYI EEIGLLLKGL DHRIDGVHAE DPMRKEKVEA
FRKGDIPLLV TTTILERGVI VKNLQVAVLG AEEEIFSESA LVQIAGRAGR SFEEPYGEVV
YFHYGKTESM VRAKRHIQSM NKSAKEQGLI D