Gene Apar_0318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0318 
Symbol 
ID8413166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp364294 
End bp365592 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content48% 
IMG OID645021885 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_003179340 
Protein GI257784123 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.024163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACA GCAGAATCTC GTTTGATCAA TGGGGTGATC TGTATGCAGA TCGCGTTCAA 
ACCATGCGAA AGAGCGAAGT ACGAGATTTG TTTGCGGCTC TGTCTCGTCC AGGCGTTATT
GCGCTGTCTG GTGGCCTGCC GGACATTTCT TCGCTGCCGC TTGATCAGGT TGCAGAGTGC
GCTCGACGCT GCGTTGCTGT TGAGGGACTC AGGTCTCTTC AGTACGGTAA CTCTGATGGC
CGTATTGAGT GCAGAAAGAC CATCTGCAAG ATTTTAGCTG TTCAGGGTAT TGAAGCTGAC
CCTGATGAGA TGATTTTGAC TTCTGGTTCT CAGCAGGCGC TTGATTTCTT GGGTCGTGTT
TTTCTTAATC CGGGTGACGA TATTATCTGT GAGGGTCCAA GTTATCTTGG TGCATTCCAG
GCATTTTCTG CGTATGAGCC TTCCGTTCAT ACTATTGATA TGGACGAAGA GGGCATCAGA
ACTGATTTGC TTGAGGCAAA GCTTAAAGAG CTTGCAAGTC AAGGTAGAAA GCCTAAGTTT
ATCTACGTTA TTCCTAACTT TAATAATCCC GCTGGTATCA CCATGTCTAT GCCAAGGCGC
TTGCGCCTTC TTGAGTTGGC CCACCAGTAC AATATTCCTG TTGTCGAAGA TGATCCGTAT
GGTCTTATTC GTTTTGAGGG AGAAGATTTG ATACGTTTGA AGTCGCTTGA TTCAAACGTT
ATTTACCTAG GGACTACTTC AAAGATTTTT GCCCCTGGTC TGCGTCTAGC ATGGATGGTT
GCACCTCGTC ACTTCCTCGA GCGCATTAAT CTTGCTAAGT CTGGTTCTGA CCTGTGTACC
AGCCCCTTTA ATATGATCCT TGCTGAGCAC TACTTTAATG AGGTTGATTG GCAAGCAGCA
CTTGAGGTTT CCAAGTCTCG CTATAAAGAG AGAAAAGATG CCATGCTGGC AGCTCTTGAG
GAGTTCTTTC CCCAAGATGT TACCTGGACT AAGCCTGAGG GCGGTCTGTT CCTGTGGGTA
ACCTTCCCGC CATACCTTAA CACTGAGCAG CTGCTTCCTC GTGCCATTGA GGAAGGAGTT
GCCTTTGTAC CTGGCGTTTA TTGCTATCCT GACCAGCGTA TTAGCTCAAG TATGCGTCTG
TGTTACTCAT TTGAAACTCC GGAACGCATT CGTGAGGCAA TCCGTAGGTT GTCTTTGTGC
GTCCAAGATC GTATGGCGTT GTATCGTGCA TTCTTGGAGG CAGGAGCTCT GCCCGAGTCT
TCTCGTTCCG AATCTTATTC TTCTGCAAGG GAGTGTTAA
 
Protein sequence
MDDSRISFDQ WGDLYADRVQ TMRKSEVRDL FAALSRPGVI ALSGGLPDIS SLPLDQVAEC 
ARRCVAVEGL RSLQYGNSDG RIECRKTICK ILAVQGIEAD PDEMILTSGS QQALDFLGRV
FLNPGDDIIC EGPSYLGAFQ AFSAYEPSVH TIDMDEEGIR TDLLEAKLKE LASQGRKPKF
IYVIPNFNNP AGITMSMPRR LRLLELAHQY NIPVVEDDPY GLIRFEGEDL IRLKSLDSNV
IYLGTTSKIF APGLRLAWMV APRHFLERIN LAKSGSDLCT SPFNMILAEH YFNEVDWQAA
LEVSKSRYKE RKDAMLAALE EFFPQDVTWT KPEGGLFLWV TFPPYLNTEQ LLPRAIEEGV
AFVPGVYCYP DQRISSSMRL CYSFETPERI REAIRRLSLC VQDRMALYRA FLEAGALPES
SRSESYSSAR EC