Gene Apre_0087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0087 
Symbol 
ID8396838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp107793 
End bp108902 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content35% 
IMG OID644994426 
Producthistidine kinase 
Protein accessionYP_003151861 
Protein GI257065605 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00115186 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AGAAATCCTA CGACTCTATA GTAACAAGGC TAATAATTTC CATTGTAATT 
ATATTTGTCC TAATGGTTCT CGTTTCGAAT TTTCTTATAA ATAGGAAACA AATTATGATA
ATGGAGGAGG TCTTTGAGGG CTTTTCTAGA AACTTTCCAG AAAATAACGA GTCGGTAGTC
CTTCTTATAG ATAGTGCCCA GGTCCAATCC ACAAGTGCTT TTAGGCTGTA TCTCGGCTTG
GTTGTAGCTT CAGTCGTTCT GATTGGATCT TTGACCTTTG TCTTTATAAT CAAAAGGACC
CTAAAACCCC TAAACAAACT AGAAGAGAAG ATAGGAAGGG TGGATATAGA AAATCCCGAT
AGTTTCTCGG AAAATCTAGT CTTGGTTGAG GGGCCTACGG AGATCAAGGA GCTTTCCAAG
AAGTTTGATG ATTTAATCCA AAGGATTTAT AAGGATTACA AGAAGCAAAA GGAATTCTCA
TCAAATGTGG CCCATGAGCT AAGGACACCC ATAGCTATAA TGCAGGCCCA GGTCGATGTT
TTTAGAGAGA AGAATACTGA CGAAAATAAT CTAGACTTTA TTGAAACAAT GGATTCCAAT
CTTAAGAGGC TCAAAAATCT TATCGATTCG GTCCTTCTTT TAAGTAAGAG AAATAAATTA
AAGATTAGCT CTGTTAATCT TGATAATATG ATAGATGAGA TTTTGTTTGA CCTAGATGAT
TTTGCCTCTA AGAAAAATAT TAGCCTAGAC TATCACTATT CTAATATAAG CATTGATTCG
GATGATGTCC TAATCCAAAG GCTTATCTTT AATATAGTAG AAAATGCCAT CAAATATACC
GAAGAGGGAG GCTTAGTTGA TGTTAATGTG AGTCAAAATG ATAAGGAGAC TGTAATAAGA
ATATCCGATA CTGGAATTGG GATTAGTGAT GAGAAAAAGG AGGCAATATT TGATCTTTTC
TATCAAGTAG ATGACTCAAG AAACAAAGAA GGCTTTGGTA TAGGCCTTTC TCTATCTAAA
GATATAGCCG AAACTTTGGG GGCGAGGATA GAAGTAAGAG ACAATAAGCC TAAGGGAACA
ATATTTTTGA TAAAATTTAG AAATATTTAA
 
Protein sequence
MKNKKSYDSI VTRLIISIVI IFVLMVLVSN FLINRKQIMI MEEVFEGFSR NFPENNESVV 
LLIDSAQVQS TSAFRLYLGL VVASVVLIGS LTFVFIIKRT LKPLNKLEEK IGRVDIENPD
SFSENLVLVE GPTEIKELSK KFDDLIQRIY KDYKKQKEFS SNVAHELRTP IAIMQAQVDV
FREKNTDENN LDFIETMDSN LKRLKNLIDS VLLLSKRNKL KISSVNLDNM IDEILFDLDD
FASKKNISLD YHYSNISIDS DDVLIQRLIF NIVENAIKYT EEGGLVDVNV SQNDKETVIR
ISDTGIGISD EKKEAIFDLF YQVDDSRNKE GFGIGLSLSK DIAETLGARI EVRDNKPKGT
IFLIKFRNI