Gene BAS3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3071 
Symbol 
ID2848884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3051891 
End bp3052919 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content34% 
IMG OID637506315 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_029328 
Protein GI49186076 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.316451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA CTAAAGATTT AGAAAATATA ATTTTAGATC ATGGAACAGG AGGACTATTG 
AGTCAGGATT TAATTAGTTC AATTATTACT GCAAAACTAG AAGATGTTCA CCTTGGAAAA
ATGGAGGATA GTGCCATTCT TGAGGTAAGT AGTAGAAGAT TGGCGATGAC AACAGATTCT
TTTGTTATTG ATCCTATCTT TTTTGGTGAG GGGAATATAG GTAAAGTAGC AGTTTGTGGA
ACGGTTAATG ATTTAGCGGT TAGTGGTGCA AAACCACTTT ATCTATCATT GGCATTAGTA
CTAGAAGAGG GTTTTCCAAT TAAAGATTTG GAAGAAATAT TAGATTCTAT AAGAGAGACC
GCGAAAGAAG CTGGAGTGTA TATTGTTGCT GGTGATACAA AAGTTGTTAA AAAAGGTGAA
GTAGACAAAA TCTTTATTAA TACAACTGGA ATAGGAGTTT TTGAAGGAGA TGTAGCACCG
TTTTCGGTAA ACTCTATTCA AGAAGGTGAC GATATCATTA TAACAGGACA ACTAGGAAAC
CATAGTATAC ATATCCTTTC TATCAGAGAA GGATTAGGGT TTGAACAGAG AATAAATAGT
GATTGTGCAC CATTAAATCA TATGATTACA GAGTTAAAAA ATCATTTTGG TGATTCTATA
CATTGTATGC GAGATATTAC AAGAGGCGGG TTAGGTACAG TTTTAAATGA AGTCTCAGAA
ACAATTAATA CTGGAATAAA AATACAAGAA AAAGATATTC CTATGTTAGC AGAAACTATT
ATGGCTGCCG ACATGTTAGG TGTTAATCCA ATGTATCTAG CTAATGAGGG CAATGTTTGT
ATGTTTGTGT CTCCAGAGGT AAGTGAGGAA GTCGTGAGGG TATTAAAAAA TACTAAATAT
GGTAAAGAAG CTGCGGTAAT TGGTAAAGTT ACTCAAACAA AAGAAAGACA AGTACTCATG
GAAGCAAAAT CAGGTGAATT GAAACTCATT GAGTTATTAT ATGGGGCAGA ATTACCTCGA
TTATGTTAG
 
Protein sequence
MKLTKDLENI ILDHGTGGLL SQDLISSIIT AKLEDVHLGK MEDSAILEVS SRRLAMTTDS 
FVIDPIFFGE GNIGKVAVCG TVNDLAVSGA KPLYLSLALV LEEGFPIKDL EEILDSIRET
AKEAGVYIVA GDTKVVKKGE VDKIFINTTG IGVFEGDVAP FSVNSIQEGD DIIITGQLGN
HSIHILSIRE GLGFEQRINS DCAPLNHMIT ELKNHFGDSI HCMRDITRGG LGTVLNEVSE
TINTGIKIQE KDIPMLAETI MAADMLGVNP MYLANEGNVC MFVSPEVSEE VVRVLKNTKY
GKEAAVIGKV TQTKERQVLM EAKSGELKLI ELLYGAELPR LC