Gene GBAA_pXO2_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_pXO2_0066 
SymbolcapB 
ID2820410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007323 
Strand
Start bp55599 
End bp56993 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content37% 
IMG OID637682943 
Productcapsule biosynthesis protein Capb 
Protein accessionYP_016576 
Protein GI47566735 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA TAAAAATTGT AAGAATATTG AAACATGATG AGGCAATACG CATTGAACAT 
AGGATTTCAG AATTATACTC AGATGAATTC GGTGTTGTAT ATGCAGGGAA CCACCTAATT
TTTAATTGGT ATCAACGACT CTACTTAAGT CGAAATATCT TAATAAGCAA GAAATCGAAA
AGCAGGAAGG GATTAATACA GATGATCTTC ATAATAGGTA TATGTACAGT GTTTTTGATT
ATTTATGGTA TATGGGAACA ACGTTGCCAT CAGAAAAGGC TCAATTCTAT CCCAATTCGA
GTAAACATAA ATGGAATTCG AGGTAAATCT ACCGTTACAA GACTAATTAC AGGTGTTGTA
CAAGAAGCGA AATATAAGAC TGTAGGGAAA ACAACTGGTA CATCTGCGCG AATGATATAT
TGGTTTACTG ACGAGGAGCA ACCGATTAAG CGCCGTAAAG AAGGTCCTAA TATCGGTGAG
CAACGCAGGG TAGTTAAAGA GGCTGCTGAT TTAGAAGCAG AAGCACTTAT TTGTGAATGT
ATGGCAGTTC AACCCGATTA TCAAATTATC TTCCAAAATA AAATGATTCA AGCAAATGTT
GGAGTGATTG TAAATGTTTT AGAAGATCAT ATGGATGTTA TGGGACCTAC ACTTGACGAA
GTAGCTGAAG CTTTCACTGC TACCATTCCA TATAATGGAC ATTTAGTCAC TATTGAAAGT
GAATACTTGG ATTACTTTAA AGAGGTTGCA GAAGAGAGAA ATACAAAAGT GATTGTTGCG
GATAATTCTA GAATTTCAGA AGAATTCTTA CGAAAATTTG ATTACATGGT CTTCCCAGAT
AATGCATCGC TTGCTTTAGC GGTAGCAGAG GCTCTTGGGA TTGATGAGGA AACAGCATTC
CGTGGTATGT TGAATGCTCA TCCGGATCCA GGAGCAATGA GAATTACACG TTTTGCTGAC
CAATCTAAGC CTGCGTTCTT CGTAAATGGT TTTGCAGCGA ATGATCCCTC ATCAACATTA
CGTATTTGGG AACGTGTGGA TGATTTTGGA TATAGTAATC TAGCTCCAAT TGTAATTATG
AATTGCCGCC CTGACCGCGT TGATCGTACT GAGCAGTTTG CTAGGGATGT TTTGCCATAT
ATTAAAGCGG AAATAGTTAT TGCGATTGGA GAAACGACTG CACCTATTAC AAGTGCTTTT
GAAAAAGGAG ATATTCCAAC GCAAGAGTAT TGGAACTTAG AAGGCTGGTC AACAAGTGAA
ATTATGTCTC GTATGCGTCC ATATTTAAAA AATCGGATTG TATATGGAGT GGGTAATATT
CATGGTGCAG CTGAGCCATT AATCGATATG ATTATGGAAG AACAAATTGG CAAAAAGCAA
GCAAAAGTGA TTTAA
 
Protein sequence
MKNIKIVRIL KHDEAIRIEH RISELYSDEF GVVYAGNHLI FNWYQRLYLS RNILISKKSK 
SRKGLIQMIF IIGICTVFLI IYGIWEQRCH QKRLNSIPIR VNINGIRGKS TVTRLITGVV
QEAKYKTVGK TTGTSARMIY WFTDEEQPIK RRKEGPNIGE QRRVVKEAAD LEAEALICEC
MAVQPDYQII FQNKMIQANV GVIVNVLEDH MDVMGPTLDE VAEAFTATIP YNGHLVTIES
EYLDYFKEVA EERNTKVIVA DNSRISEEFL RKFDYMVFPD NASLALAVAE ALGIDEETAF
RGMLNAHPDP GAMRITRFAD QSKPAFFVNG FAANDPSSTL RIWERVDDFG YSNLAPIVIM
NCRPDRVDRT EQFARDVLPY IKAEIVIAIG ETTAPITSAF EKGDIPTQEY WNLEGWSTSE
IMSRMRPYLK NRIVYGVGNI HGAAEPLIDM IMEEQIGKKQ AKVI