Gene Apar_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1045 
Symbol 
ID8413918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1181195 
End bp1182568 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content43% 
IMG OID645022634 
Producthistidine kinase 
Protein accessionYP_003180064 
Protein GI257784847 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAG AGAAATCTAA TAAAGCAACT CCACAAGAGT ATAAAAAACC CTTCTTTGCT 
CACTCCGATT CCTCAACTGC AGGTGTTATT ACCTGGGGAT TTTGGTGGCG TAAGCTAATA
AATTACATTG GCTTCAATTT CTTTCTATTA GTGAATATTA CGCTTGTTTA TATTTATATG
TATAACCAGC ACCTGCCACA GGGCACTTTT TACCTGGGAT TTTTTCCCAT CGAGTCTAAT
TCTATTTCTC TTACTGGTTT CTCTTTCTTG CACGGACTCT CAAGCCTCAA ATACATTGTT
CACGGAATAA CCTTTGGTGC AAAAATATTT GATTTAGGTG CGGACTTAAC TCGTTTCTGG
CCAGCATATC TTGCTATTCT CATCTGGGAA TTTATTGACA TGCTGCATTT TTTTAGCGAT
ATGCGCCGTG TCAGAAGAGC TCTTCAACCC CTCAATACAC TTGCTCTTAA AACAGAACAA
TTGATTAATA GTGATGTACT AGCAACTAAC ACTACAGCCA CTAATGACAT CCTAGTTAAG
AAGGATAAAA TGAGGAGTCT TGAACAGGCT ATTGAAGAAG CCAATGTCAA CTCTCCAAAG
ATTCAAACAG GCGACCAAGA CCTTGCAAGT ATTGAAATTG CTTTGAATAA GCTGCTTCGC
CGTATGCAAG AAGCAAAGTT GCAACAAATG CGCTTTGTCA ACGATGCTAG TCACGAACTC
CGCACACCTA TAGCTGTTAT TCGAGGTTAT ACCGACATGC TAGATCGCTG GGGTAAAACA
GACGAAGCGG TACTTGACGA ATCCATTACT GCACTCAAAT CTGAAAGTCA GCACATGCAT
GACCTGGTTG AACAGCTCCT CTTCTTAGCA CGTGGAGACG CAGGAAGAAA TACCCTCACA
AAGATCCAGC TCAATCTTGC GCAGATAGCT TCTGAGGTCT GGGAAGAATC GGAGATGATT
GACCCTGACC ACCGCTATGC TCTGAAGTTT GATCAAAGTG CGCTGTCAGA TGACCACTAC
CAAGTACTTG CCGATACTGC CATGATTAAG CAATCTATCC GTATTATCGT GCAAAACGCT
GCAAGATATT CTGCTGCCCA AACTACCATT TCTTTTAACG TCACATATGA CGAGAAAACC
GTTCAAGTTT CAATTGAGGA CGAGGGTATG GGTATATCGG AGGCTGCTGC TGCTCATATT
TTTGAGAGGT TCTGGAGAGC TGACAACGCC CGCATTGAGA GCAACGAAGG TTCTGGACTT
GGCTTATCCA TAGCAAAATG GATTGTCGAC AACCATGATG GTTCTATTAA AGTGGTTTCA
CGCGAGGGCG TAGGCACGCG CTTTACTATC GTTCTACCAC ACAAAGTTTC ATAG
 
Protein sequence
MSSEKSNKAT PQEYKKPFFA HSDSSTAGVI TWGFWWRKLI NYIGFNFFLL VNITLVYIYM 
YNQHLPQGTF YLGFFPIESN SISLTGFSFL HGLSSLKYIV HGITFGAKIF DLGADLTRFW
PAYLAILIWE FIDMLHFFSD MRRVRRALQP LNTLALKTEQ LINSDVLATN TTATNDILVK
KDKMRSLEQA IEEANVNSPK IQTGDQDLAS IEIALNKLLR RMQEAKLQQM RFVNDASHEL
RTPIAVIRGY TDMLDRWGKT DEAVLDESIT ALKSESQHMH DLVEQLLFLA RGDAGRNTLT
KIQLNLAQIA SEVWEESEMI DPDHRYALKF DQSALSDDHY QVLADTAMIK QSIRIIVQNA
ARYSAAQTTI SFNVTYDEKT VQVSIEDEGM GISEAAAAHI FERFWRADNA RIESNEGSGL
GLSIAKWIVD NHDGSIKVVS REGVGTRFTI VLPHKVS