Gene BAS4668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4668 
Symbol 
ID2850811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4561298 
End bp4562518 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content32% 
IMG OID637507902 
Productsensor histidine kinase 
Protein accessionYP_030912 
Protein GI49187659 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.304477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAG ACGGTATATT TAAAAATGAA GAAATGAAGG CGCTAAAAAT ATTTTTAAGT 
TTATTTTTCA TTATATTTTT TGTATACGAT CTTGCCTACG AATTTATTGT ACCTTTAATA
GGAGGAGAGC AAGAAGGAGT AGGACAATTT GAAGATGGTT TAGGTTTATG GCTTTATTTT
CTGATGGTGG TCCTATTTTG CACTGGAATA TACTTTATGA AATGGAAGAA TCCATTTGCA
GTGAAATATA TTATACTAAT TGGATATAAT CTATTAGATT TTATCCATAA TTTTATGATT
TACTATGGTA GTGATGCTGA GTTTGATGGT GGGAATATAG TAGAAGGATT TTTTATTTTA
TTTGCCCCAC TCTTTGTGAA TAAAAGATAT TTTTGGTTAG TTCCGAGCAT ACTTATTGGA
AAATACGCTC TTACTGGAAT CATCATTCAC TCGTCACTCG TTTTAATCCC GATGGCATTA
TATGGCGTAT TTACTATCAT ATGTTGGATT ATGTTTTTAA GATTTCACTC TTACGTTCGC
ACGCTTGAGA TGATGGATAA AGAAATACAA CAAACAGAAA AGCTAGCAAC TGTTGGGAAA
ATGGCTACAG TTATTGGTTA CAAAATTAAA AGACCTTTAG CTAATTTAGA TAAATTTGTT
AATAAGCAAG CGATTAAATA TCCAGAGGAC AAAATATATA GTGATATTAT GAAACAAGAA
GTAGAACGAA TTCATATAAT AGCTACAGAA CTTAGTGGAT TTGAGAAATC TAAATCAATA
GAATCAGAAG TTCATAATAT AGAAGAAATT ATCGCTTATG TTATTCGAGT TATGGGGAAG
CCTGCATTAA ATCAAGGCGT GCACATACAA GGTATTTATA GTAAAGACAT ACCATCGATT
ACATGCGATG AAAAACGATT AAAACAAGTA TTTTTTAATT TAATTAAAAA TGCGATTGAA
GCAATGTCAG TTGGCGGAAC GATTACAATT AAAGTGACTG TAGAAGATGC AATCATTATT
CAAGTGAAGG ATGAGGGTTG CGGCATTCCA AAAGAAAAAA TTCCTAAGTT AAACGAAGCC
TTTTACACAA CGAAAGAAAC GGGAACAGGT TTAGGTTTAG TAGTTACAGA AAAAATTATT
AAAGATCACA ATGGTAAAAT GAGTTTTGAA AGTGAAGTTG GGGTTGGAAC GACGGTGAAG
GTTATGTTGC CGATACAATA A
 
Protein sequence
MNKDGIFKNE EMKALKIFLS LFFIIFFVYD LAYEFIVPLI GGEQEGVGQF EDGLGLWLYF 
LMVVLFCTGI YFMKWKNPFA VKYIILIGYN LLDFIHNFMI YYGSDAEFDG GNIVEGFFIL
FAPLFVNKRY FWLVPSILIG KYALTGIIIH SSLVLIPMAL YGVFTIICWI MFLRFHSYVR
TLEMMDKEIQ QTEKLATVGK MATVIGYKIK RPLANLDKFV NKQAIKYPED KIYSDIMKQE
VERIHIIATE LSGFEKSKSI ESEVHNIEEI IAYVIRVMGK PALNQGVHIQ GIYSKDIPSI
TCDEKRLKQV FFNLIKNAIE AMSVGGTITI KVTVEDAIII QVKDEGCGIP KEKIPKLNEA
FYTTKETGTG LGLVVTEKII KDHNGKMSFE SEVGVGTTVK VMLPIQ