Gene BAS4690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4690 
Symbol 
ID2852647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4577434 
End bp4578783 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content38% 
IMG OID637507924 
Productcytochrome d ubiquinol oxidase subunit I 
Protein accessionYP_030934 
Protein GI49187681 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACG TTCTGTTACT GAGTCGTTTT CAATTTGCAA TTACTATTTT TTATCACTTT 
TTATTTGTAC CTTTGACAAT CGGACTTGTC ATTTTAGTAG CATGTATGGA AACTCAATAC
GCCCGCACAT TGAATCCAAC ATACCGCAAA ATGGCAAATT TCTGGGGTAA ATTATTTACA
ATTAACTTCG TAATGGGGAT TATAACCGGG ATTACGATGG AATTCCAATT TGGAACAAAC
TGGTCTGAGT ACTCCAAATA TATGGGAGAT ATTTTCGGAT CCCCTCTCGC AATCGAAGCA
CTCGTTGCCT TCTTCTTAGA ATCTACTTTC ATGGGAATAT GGTTATTCGG TAAAGACAAA
ATTTCACCAA AGTTCCGTGC CTTCTGTATG TGGATGGTTG CACTTGGAAC AAATATTTCC
GCCCTTTGGA TTATTACAGC AAACGGCTTT ATGCAAAACC CTGTTGGCTA CGTCGTACGT
AACGGCCGCG CTGAATTAAA TGATTTCTGG GCACTCGTTA CGAATCCATA CGCTTGGAAC
ATGTTCTTCC ATACTGTAAT TGGTTGTTAT ATTGTTGGTG CTTTCTTCGT TATGGCAATT
AGTGCCTATC ACTTATTACG AAAAAATGAA GTTGAATTCT TCAAAAAGTC ATTTAAGTTT
GGTTTAATGT TAGGCTTATT CGCCGCAACA ATTACACCGT TTATAGGACA TCAATCTGGT
GTATCAGCAG CTAAATATCA ACCAGCTAAA GGTGCTGCGA TGGAAGCTGT TTGGGAAACT
GGAAAAGGAC AAGGCTTCTC GATTGTTCAA ATTCCTGATG TAAAAAACGA AAAGAACTTT
GAATTCCTTA CGATTCCAAA GTTAGGAAGT TTCTTCTATA CAAATTCATT TGATGGCGAA
ATTGTTGGTT TAAAAGATAT TCCGAAAGAA GATCGTCCAA ATGTTAACCT TGTGTACTAT
AGCTTCCGCT TAATGGTTGC ACTTGGTATG TTCTTTATGG CATTAACTTG GTACGGTTTC
TACTTAAACC GAAAAGGAAA ACTGGAAAAC TCAAAACGTT ACTTAAAAAT TACAATATGG
TCTGTTTTAC TACCATATAT CGCAATTAAC GCTGGTTGGA TTGTTGCCGA AGTAGGTCGT
CAACCATGGA CAGTTTATAA ACTAATGCGT ACAGCAGAAT CTGTATCACC TATATCTGTC
CCGCAAATTT GGTTCTCTTT AATTAGTTTA ATCTTGTTCT ATACTTTACT TTTAATCGCA
GACGTATATT TAATGCTGAA GTTCGCGAAA AAAGGACCAG CAGCATTAGA AGAACCTGCT
ACTAAGGGAG GCGTGGCTCA TGTCTCATGA
 
Protein sequence
MSDVLLLSRF QFAITIFYHF LFVPLTIGLV ILVACMETQY ARTLNPTYRK MANFWGKLFT 
INFVMGIITG ITMEFQFGTN WSEYSKYMGD IFGSPLAIEA LVAFFLESTF MGIWLFGKDK
ISPKFRAFCM WMVALGTNIS ALWIITANGF MQNPVGYVVR NGRAELNDFW ALVTNPYAWN
MFFHTVIGCY IVGAFFVMAI SAYHLLRKNE VEFFKKSFKF GLMLGLFAAT ITPFIGHQSG
VSAAKYQPAK GAAMEAVWET GKGQGFSIVQ IPDVKNEKNF EFLTIPKLGS FFYTNSFDGE
IVGLKDIPKE DRPNVNLVYY SFRLMVALGM FFMALTWYGF YLNRKGKLEN SKRYLKITIW
SVLLPYIAIN AGWIVAEVGR QPWTVYKLMR TAESVSPISV PQIWFSLISL ILFYTLLLIA
DVYLMLKFAK KGPAALEEPA TKGGVAHVS