Gene BAS4191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4191 
Symbol 
ID2852180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4105986 
End bp4107107 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content38% 
IMG OID637507427 
Producthypothetical protein 
Protein accessionYP_030439 
Protein GI49187187 
COG category[S] Function unknown 
COG ID[COG3323] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000743156 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TTCCAAATGG CCATGAAATT ATTTCTTTAT TTGAAAGTAT GTATCCGAAG 
CATTTGGCGA TGGAAGGAGA TAAGATTGGC CTGCAGATTG GAGCGCTTAA TAAACCCGTG
CAGCACGTAT TAATTGCGTT AGATGTAACG GAAGAAGTTG TGGATGAAGC AATTCAATTA
GGAGCGAATG TCATTATTGC GCATCATCCT TTAATTTTTA ACCCGCTAAA AGCGATTCAT
ACAGATAAGG CGTATGGGAA AATTATTGAA AAGTGTATTA AAAATGATAT TGCAATCTAT
GCAGCACATA CAAATGTGGA TGTTGCTAAG GGCGGGGTAA ATGATTTACT TGCTGAGGCG
TTAGGATTGC AAAATACAGA AGTTTTGGCA CCGACATATG CAGAAGAAAT GAAAAAAATT
GTTGTGTTTG TGCCTGAAAC TCATGCAGAA GAAGTAAGAA AAGCATTAGG AGACGCAGGC
GCTGGTCATA TCGGCAATTA TAGCCACTGT ACGTTTAGTA GCGAGGGTAC AGGCGCGTTT
ATACCTCAAG AGGGAACAAA TCCTTATATC GGGGAAACTG GGCAGTTAGA ACGCGTGGAA
GAAGTGCGAA TCGAAACGAT TATTCCAGCT TCATTACAGC GAAAAGTAAT TAAAGCAATG
GTAACGGCAC ATCCATATGA AGAAGTAGCA TATGATGTGT ATCCACTTGA TAACAAAGGT
GAAACATTAG GGCTTGGAAA AATAGGATAT TTACAAGAAG AAATGACACT TGGACAATTT
GCGGAACATG TAAAGAAGTC ATTAGATGTA AAGGGTGCGC GAGTTGTTGG GAAATTAGAT
GATAAAGTGC GCAAAGTAGC TGTACTTGGT GGCGATGGTA ACAAATACAT CAATCAAGCT
AAATTTAAAG GAGCAGATGT ATATGTAACG GGGGACATGT ATTATCATGT TGCTCATGAT
GCGATGATGC TCGGTTTAAA TATAGTTGAC CCAGGACATA ACGTTGAAAA GGTAATGAAG
CAAGGTGTAC AAAAGCAATT ACAAGAAAAA GTGGATGCAA AGAAACTTAA TGTAAACATT
CATGCTTCGC AGTTACATAC AGATCCATTT ACATTTGTAT AA
 
Protein sequence
MSKIPNGHEI ISLFESMYPK HLAMEGDKIG LQIGALNKPV QHVLIALDVT EEVVDEAIQL 
GANVIIAHHP LIFNPLKAIH TDKAYGKIIE KCIKNDIAIY AAHTNVDVAK GGVNDLLAEA
LGLQNTEVLA PTYAEEMKKI VVFVPETHAE EVRKALGDAG AGHIGNYSHC TFSSEGTGAF
IPQEGTNPYI GETGQLERVE EVRIETIIPA SLQRKVIKAM VTAHPYEEVA YDVYPLDNKG
ETLGLGKIGY LQEEMTLGQF AEHVKKSLDV KGARVVGKLD DKVRKVAVLG GDGNKYINQA
KFKGADVYVT GDMYYHVAHD AMMLGLNIVD PGHNVEKVMK QGVQKQLQEK VDAKKLNVNI
HASQLHTDPF TFV