Gene GBAA_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3841 
Symbol 
ID2815459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3514729 
End bp3516165 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content45% 
IMG OID637790563 
Producthypothetical protein 
Protein accessionYP_020476 
Protein GI47529128 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000174288 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGTT ATGACGACAG TCAAAACAAA TTCTCCAAAC CATGCTTTCC AAGTAGCGCT 
GGACGAATCC CGAATACTCC ATCAATCCCA GTTACTAAGG CACAACTTAG AACATTTCGC
GCAATCATTA TTGATTTAAC AAAAATAATC CCAAAACTTT TCGCAAATCC ATCTCCCCAA
AATATTGAAG ATCTAATCGA TACATTGAAC CTACTAAGTA AATTTATTTG TTCACTAGAC
GCTGCTTCCT CCCTGAAAGC ACAAGGATTA GCTATTATTA AAAACTTAAT AACTATATTA
AAAAACCCAA CTTTCGTAGC AAGTGCTGTA TTTATCGAGC TTCAAAATCT AATTAATTAT
TTACTATCCA TTACAAAACT ATTCCGAATT GACCCTTGCA CACTTCAAGA GCTTCTTAAA
TTAATAGCAG CATTACAAAC CGCTTTAGTT AATTCTGCTT CATTCATTCA AGGACCTACT
GGACCTACTG GACCTACTGG GCCAGCTGGT GCTACCGGTG CTACTGGACC TCAAGGTGTT
CAAGGACCAG CAGGCGCTAC CGGTGCCACT GGACCTCAAG GTGTTCAAGG ACCAGCAGGT
GCTACTGGCG CTACTGGACC TCAAGGTGCT CAAGGACCAG CAGGTGCTAC CGGTGCTACT
GGACCTCAAG GTGCTCAAGG ACCAGCAGGT GCTACTGGTG CCACTGGACC TCAAGGTATT
CAAGGACCAG CAGGTGCTAC CGGTGCTACT GGACCTCAAG GCGTTCAAGG GCCAACGGGT
GCTACTGGTA TAGGAGTTAC CGGACCTACT GGGCCTTCTG GTGGGCCTGC TGGTGCTACT
GGACCTCAGG GACCTCAAGG TAATACAGGT GCTACTGGAC CTCAAGGTAT TCAAGGGCCT
GCTGGTGCTA CTGGTGCCAC TGGACCTCAA GGTGCTCAAG GACCGGCTGG TGCTACCGGC
GCTACTGGAC CTCAAGGTGT TCAAGGGCCA ACGGGTGCTA CTGGTATAGG AGTTACCGGA
CCTACTGGGC CTTCTGGACC TAGCTTCCCT GTAGCAACAA TTGTTGTAAC AAACAACATT
CAACAAACAG TACTCCAATT TAACAACTTC ATTTTTAATA CTGCAATTAA CGTAAACAAC
ATTATCTTCA ACGGCACAGA TACAGTTACT GTTATCAACG CTGGTATTTA TGTCATTAGC
GTATCCATCT CTACAACTGC ACCAGGATGT GCACCACTCG GAGTAGGAAT TTCAATAAAT
GGAGCAGTCG CAACTGACAA CTTCTCTTCA AATCTAATAG GCGACTCACT TTCATTCACT
ACGATCGAAA CGTTAACTGC CGGCGCGAAC ATTTCTGTCC AATCCACTCT TAATGAGATT
ACGATCCCTG CAACAGGAAA CACTAATATT CGTCTAACTG TATTTAGAAT CGCTTAA
 
Protein sequence
MSRYDDSQNK FSKPCFPSSA GRIPNTPSIP VTKAQLRTFR AIIIDLTKII PKLFANPSPQ 
NIEDLIDTLN LLSKFICSLD AASSLKAQGL AIIKNLITIL KNPTFVASAV FIELQNLINY
LLSITKLFRI DPCTLQELLK LIAALQTALV NSASFIQGPT GPTGPTGPAG ATGATGPQGV
QGPAGATGAT GPQGVQGPAG ATGATGPQGA QGPAGATGAT GPQGAQGPAG ATGATGPQGI
QGPAGATGAT GPQGVQGPTG ATGIGVTGPT GPSGGPAGAT GPQGPQGNTG ATGPQGIQGP
AGATGATGPQ GAQGPAGATG ATGPQGVQGP TGATGIGVTG PTGPSGPSFP VATIVVTNNI
QQTVLQFNNF IFNTAINVNN IIFNGTDTVT VINAGIYVIS VSISTTAPGC APLGVGISIN
GAVATDNFSS NLIGDSLSFT TIETLTAGAN ISVQSTLNEI TIPATGNTNI RLTVFRIA