Gene Apar_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0371 
Symbol 
ID8413220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp425653 
End bp426993 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content49% 
IMG OID645021939 
Productputative phosphohistidine phosphatase, SixA 
Protein accessionYP_003179393 
Protein GI257784176 
COG category[T] Signal transduction mechanisms 
COG ID[COG2062] Phosphohistidine phosphatase SixA 
TIGRFAM ID[TIGR00249] phosphohistidine phosphatase SixA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGA CTTTGGTTCT TGTTAGACAT GGTGTATCTG AACGTGGCTC TGAAGATATG 
AGTCGAGAGT TAACACGTGC TGGTCAGAGG GCTCTTTCTG CCAATTACCC TCATATCTTT
GGTCTGTTGG GAGCAGAGGG TGAAGAGGCT GAGATTTGGA CAAGTCCCGC TCTTCGTGCA
CTGGAGACCG CAGAGATTGT GGCAGAAGCC TTGGATGCAG AGGGCCTAGA GATTCATGAC
TCCCTTTACG ATCAAGATCT TCCTGCACTT CAGGCAGAGC TTGAGCACGC TGATGCAGAG
ACGCTGGTTC TTGTTGGTCA TGCTCCATTT TTGGGCTATG TTGCAGAGAC TCTTCTGGGA
TTTGAGCTTC CGCTGACAAA GGGTGCTGTG TGCGCAATTG ACGTACGTGG TTCACTTGGT
CATCAACACG AGTGCGTCTG GAAGCAACTT GGAGATGTTC GTGAGCCTCA TGGTAAACTT
CTTTGGCTGG TAAGCGGTCC TTCTACACAG CCGTGGGAGA CACTTGATGC CCTGGATGAG
GCGTGCGCTC ATGCAGCAAC CAATCTGGAA GATGCGTACA CAGAGTTTCG TGTCCATCCA
GAAGATCCTG CGGTAATCAA AGCATTTAGA TTTGCTCTTA GGGGAACCCA GCTGCTTACC
AAGTTCTTCT CGCCACTTCT TAACGAGGAG GCGGTTGAGA TTGCTGAGCC TGTTTATAGG
CTGATGCTTG GAGCAACTAC GCGCCTTCGT GAAATTGATG GCTTCTCTGA TACCGTTGCA
GATTTGATGG AGTCTGGAGA GCTTTCACAG GGCTCTAAGC TAGTCAGCGC TGTAGGGGCA
GCTCGCGAGG CTGAGCGTGA TCGTGTCTGT GAAGGTTTGC GCAAGAAGGC GGTTCGCCGT
AGTCTGCGCT GCGCTTTGGA TGAGCTCTTT GAGCCCGCAT GGTCAGATAC GGTTTTGCAG
GACGGTCTCT CATTTGAAGA CATTACCAGC CGCTTTGACT ATATGCTTGA GACTATTGAT
GCACGTCTGT TTGGTCTGGA TATGACCAGT TTTTCCGAGG TTCATCACGC AAGGCGAGAA
GTCCGTGAGG TTGAGCACAT TCTGCTCCAC CTCAGTGACA CGCTAGGTGA AAAGCGCGCT
AATTACGCTC AGATTATGCA GGATATTGAC TCTGAGCTCA GTACTGTTTG TACAGCTCAG
CGAAACATTT CTCTGGTCAA GGAGTGGAAA GACTCCATGG ACTTCAGAGA TGTTACCTCA
GATCTCGCAA TTGTTTCAGA ACATGAAAAA GTTTTAATTA AGCGTGTTAT TGAAGGTAGA
GAGACGAGTA TCCTTAGATA A
 
Protein sequence
MVKTLVLVRH GVSERGSEDM SRELTRAGQR ALSANYPHIF GLLGAEGEEA EIWTSPALRA 
LETAEIVAEA LDAEGLEIHD SLYDQDLPAL QAELEHADAE TLVLVGHAPF LGYVAETLLG
FELPLTKGAV CAIDVRGSLG HQHECVWKQL GDVREPHGKL LWLVSGPSTQ PWETLDALDE
ACAHAATNLE DAYTEFRVHP EDPAVIKAFR FALRGTQLLT KFFSPLLNEE AVEIAEPVYR
LMLGATTRLR EIDGFSDTVA DLMESGELSQ GSKLVSAVGA AREAERDRVC EGLRKKAVRR
SLRCALDELF EPAWSDTVLQ DGLSFEDITS RFDYMLETID ARLFGLDMTS FSEVHHARRE
VREVEHILLH LSDTLGEKRA NYAQIMQDID SELSTVCTAQ RNISLVKEWK DSMDFRDVTS
DLAIVSEHEK VLIKRVIEGR ETSILR