Gene Apar_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1120 
Symbol 
ID8413993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1268634 
End bp1269827 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content46% 
IMG OID645022709 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_003180139 
Protein GI257784922 
COG category[C] Energy production and conversion 
COG ID[COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATT TTGAGTATGT TGTACCTACT CAGATAGTTT TTGGAAAAGA TACTCAGCTT 
CGTGTGGCAG AGCTTGTTTC TGCTCATGGA GGAACCAAGG TGCTGGTGCA TTTTGGCGGC
AACCATGTGG TAAAGAACGG TTTACTTGAT CAGATTCATC AGCTTTTGAG TAACGCCGGA
ATTGCTTTTG TTGACTGTGG TGGAGTTGTC CCTAATCCTC GTTTAGAGCT TGCAGAAGAA
GCTATAGAAC TGGGCAAACG CGAGGGCGTT GATTTCATTT TGGCTATCGG CGGCGGATCT
GTAATGGATT CTGCTAAAGC AATTGCGTAT GGCTTAGCTA ATGACGTTTC TCTTGAAGAC
CTTTATTTAC ATAGGGTTGG TGTTTCAAAA GCAACTCCTA TAGGAGTCAT TTCTACTATT
GCGGGAACTG GCTCTGAGAC CTCGGATTCT TCCGTCATGA ACATTACGCT GAGTGATGGA
CGTATCCTTA AACGCTCTTA TCACCATCAA AGCGGTCGTC CCTGCTTTGC TATCATGAAC
CCAGAGCTTA CCTATTCCGT GAGTGCATAT CAGACAGCAT CAACGGGCGC TGACATTATG
ATGCATACTA TGGAGCGCTA CTTTACGCTT GAGAAAGACG TTGCACTTAC CGATGAGCTG
GCTGAAGGTC TTTTGCGTAC AGTAAAAGAA GCAGTGCTGA TAGCCATTAA AGAGCCTAAG
AATTACGCAG CGCGCGCAGA CTTGCTTTGG GCATCGTCGC TTTCTCACTG CGGGCTTACT
GGAACAGGTC GCGTAAGCGA TTTTGCTTCT CATGCCATTG AACATGAGCT CAGTGCAAAG
TACGATGTAG CTCATGGTGC TGGTCTTACC GCTATTTGGG CAAGCTGGGC TCAGTATGTT
ATGAATCAAG ATCCAAAGCG TTTCGCGCAG TTTGCGGTAA ATGTTTTTGG AGTTCAAAAC
AACTTTCATA ATTCAACTGC CACCGGCCTT GCAGGAATTG AGGCTTGGAA TCAATGGTGT
CATGCAATTG GTATGCCAAC CTCCCTCGCT GAACTTGGAG TAAGCCCTTC AGAAGAAGAT
ATTCTCCAGA TGGCTCAGGG AGCTATTGAC GCTCGTGGCG GAGATCATGC GGGTAACTTT
ATGCAGCTTA AAACAACAGA TGTTCAACAG ATTTTAAGGA TGGCTTTAAA GTAA
 
Protein sequence
MVNFEYVVPT QIVFGKDTQL RVAELVSAHG GTKVLVHFGG NHVVKNGLLD QIHQLLSNAG 
IAFVDCGGVV PNPRLELAEE AIELGKREGV DFILAIGGGS VMDSAKAIAY GLANDVSLED
LYLHRVGVSK ATPIGVISTI AGTGSETSDS SVMNITLSDG RILKRSYHHQ SGRPCFAIMN
PELTYSVSAY QTASTGADIM MHTMERYFTL EKDVALTDEL AEGLLRTVKE AVLIAIKEPK
NYAARADLLW ASSLSHCGLT GTGRVSDFAS HAIEHELSAK YDVAHGAGLT AIWASWAQYV
MNQDPKRFAQ FAVNVFGVQN NFHNSTATGL AGIEAWNQWC HAIGMPTSLA ELGVSPSEED
ILQMAQGAID ARGGDHAGNF MQLKTTDVQQ ILRMALK