Gene BAS1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1874 
Symbol 
ID2851357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1896069 
End bp1897070 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content38% 
IMG OID637505125 
Productluciferase family protein 
Protein accessionYP_028138 
Protein GI49184886 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAT TAAGTGTATT AGACCAATCC CCTATTTCAG ATGGTAGTAC AGCAACCCAA 
GCTTTTTCAC ATACAGTTAC GCTTGCGCAA GAAGTTGAAA AACTCGGATA TACACGCTTT
TGGGTATCAG AGCATCACAA TTCTGTAAGC CTTGCTGGTT CAAGTCCAGA AATACTTATT
TCTCATATTG CAGCAAAAAC GGAGCGTATG AGAGTTGGTT CGGGTGGTGT TATGTTACCT
CACTATAGCC CGTATAAAGT TGCTGAGAAT TTCCGTGTTT TAGAAGCGCT TTATCCAAAT
CGTATTGACC TTGGTGTCGG CAGAGCACCT GGCGGTATGC CAATTGCAAC TCGCGCACTT
CAAGAAGGAA AAATGGTCTC ACTTGATCAA TATCCAGAAC AAATTGCAGA TGTTGCAATG
TATTTACATG ATCAAGTACC AGAAAACCAT CATTATGCGA ATTTAAAAGC TACTCCTGTG
ATTCCAACAT CTCCCGACAT GTGGTTGCTT GGCTCTAGTG GAGAGAGTGC AAAGATCGCT
GCACAGCAAG GTGCTTCTTT TGCATTCGCA CAGTTCATAA ACGGATACGG TGGACCCGAG
GTAATGGAGG CCTATCAAAA ACAATTCCAA CCTTCCTATT TAGGAGATAA ACCAAAATCA
ATCGTCGCTA TTTTTGTTAT TTGCGGAGAA ACAAATGAAG AGGCGGAAAA AATTGCTTCA
AGTTTAGATT TATCAATCTT ATTACTAGAA CAAGGAAAGC GTACAACTGG TACTCCTTCT
ATCGAAACTG CTCAAAATTA TTCATATAGC ACGTATGATT TATTCCGTAT AAAAGAAAAT
CGTCAACGTA TGATAGTAGG CGACCCGTCT TCTGTAAAAG AAAAAATCAT AAACTTAAGC
AAAGCGTACA ATACTGAAGA ATTTATGATT GTCACAATTA CTCATCGATT TGAAGATAAA
TTAAATTCTT ATCGCTTATT GGCAAATGCT TTTAATTTAT AA
 
Protein sequence
MIKLSVLDQS PISDGSTATQ AFSHTVTLAQ EVEKLGYTRF WVSEHHNSVS LAGSSPEILI 
SHIAAKTERM RVGSGGVMLP HYSPYKVAEN FRVLEALYPN RIDLGVGRAP GGMPIATRAL
QEGKMVSLDQ YPEQIADVAM YLHDQVPENH HYANLKATPV IPTSPDMWLL GSSGESAKIA
AQQGASFAFA QFINGYGGPE VMEAYQKQFQ PSYLGDKPKS IVAIFVICGE TNEEAEKIAS
SLDLSILLLE QGKRTTGTPS IETAQNYSYS TYDLFRIKEN RQRMIVGDPS SVKEKIINLS
KAYNTEEFMI VTITHRFEDK LNSYRLLANA FNL