Gene Apar_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1235 
Symbol 
ID8414114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1387367 
End bp1388671 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content45% 
IMG OID645022828 
Producthistidine kinase 
Protein accessionYP_003180252 
Protein GI257785035 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.14404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCGC TTGTTCTGCT GTTCACGGTG ACGCTGTCGG TCATCATGTT TGCTAGCACT 
CAGGAGATCA GACAGAAGAA CATGGACGTG CTCAACCGCT ACGCCAGTCA GTATTCGCTT
GAGAAGGAAA AAGGTAGCTC TGAGGGACAG AGTGGCCCAG AGGGTCAGGG TAGTTCTGAG
GGGCAGGGTA GCTCCGAGGG ACAGACAAGC TCCGAAGATC AGAGTCCGCA GCAGCCACTG
GTCAAACCTG ACGCTCAGCT GTCAAACAAA TCTGACAATC AACCGCCAGG TCAGAGATCT
GCTTACGAGC TCTCAACGTT TTACTCAGTA TCGTTTTCTG TAGACGGCTC CGTACTTTCA
GTCTTTAATG GCGAGAAAAC CGTTAGCTCT GATGAAAACC TTACTGAGTT TGCTCGTCAG
ATTTTGAACG AAGGAAATCC TTCTGGTAGA ACAGGCAATC TTTCTTACGT GGTTATGAAA
AAAGATGGCT ATACGCTTGT GGCGTTTATG GATAACACCG TTTCTGAAGC CGGTCTTCAG
ACCATGATGC GAAACGCTCT GCTTGTAGGA GGCGTATCGC TGGTAGGTAT GTTCTTTATT
TCTGTGTTTC TGGCAAAGCG CATTATTCGT CCACTTGAAG AGAGTGATAA AAAGCAGAAG
CAGTTCTTAT CCGACGCAAG TCACGAGCTC AAGACCCCTA TTGCGGTTAT TGACGCCAAT
GCAGAGATTC TATCCAGAGA ACTTAGTCAC AACGAATGGC TCTCCAACAT TCAATACGAG
AGCAATCGTA TGGGAAAGCT AGTAAAACAG CTGTTAGATT TTTCTAGTGC GGAGAATAGA
GAAGTGCCTA TGGAAAAGCT GGACTTCTCT CATGTGGTTA CTGGAGAATC ACTGGTCTTT
GAGACGTTTG CGTTTGAGAA TGGCAAGGTG CTTCAAAGCA ACATTGAAGA GGGGATTGTT
CTTACAGGCA ATCAGAATCA GCTTACGCAG GTTATTTCTG TGCTGCTTGA TAACGCCCTG
AGGCACACAA CGGGTACTCA GATTGAGTTA AATCTTAAGA AACAAGGTCA TAGCGCCATC
TTAAGTGTTA GTAATGACGC GGAAGAGATT TCTCAAGAAA AGCTTGAGCA TCTGTTTGAT
CGTTTTTATC GCGTTGATGA TGTACGAAAT AGTGAGGATA ATCACTATGG ACTAGGTCTT
TCGATTGCAC AAGCTGTGGT TCAAAAGCAT GGTGGAACTA TTAATGTAGG CTATTCAGAG
GGTCAGATTA CTTTTACTGT TCAGCTTCCT ATTAAGGGCA AATAA
 
Protein sequence
MVSLVLLFTV TLSVIMFAST QEIRQKNMDV LNRYASQYSL EKEKGSSEGQ SGPEGQGSSE 
GQGSSEGQTS SEDQSPQQPL VKPDAQLSNK SDNQPPGQRS AYELSTFYSV SFSVDGSVLS
VFNGEKTVSS DENLTEFARQ ILNEGNPSGR TGNLSYVVMK KDGYTLVAFM DNTVSEAGLQ
TMMRNALLVG GVSLVGMFFI SVFLAKRIIR PLEESDKKQK QFLSDASHEL KTPIAVIDAN
AEILSRELSH NEWLSNIQYE SNRMGKLVKQ LLDFSSAENR EVPMEKLDFS HVVTGESLVF
ETFAFENGKV LQSNIEEGIV LTGNQNQLTQ VISVLLDNAL RHTTGTQIEL NLKKQGHSAI
LSVSNDAEEI SQEKLEHLFD RFYRVDDVRN SEDNHYGLGL SIAQAVVQKH GGTINVGYSE
GQITFTVQLP IKGK