Gene Apar_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0125 
Symbol 
ID8412969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp141751 
End bp142833 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID645021693 
Productoxidoreductase domain protein 
Protein accessionYP_003179152 
Protein GI257783935 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAA CAGCGGGTCA GATGTCACAG AGTGCCACGA GCTCCGCACA TAACACTGCG 
AGCACTATGC AGAGCGCCAT GAGCGACGGT GACGCAATCT TTACCGCACC ACACCTCAAG
TGGGGTGTTA TTGGCTGCGG AGTCATTGCA AACCAGATGG CAGAGGCGCT GGCTTCGGTT
GGTAGAACCA TCGACGGCGT AGCTAACAGG ACGCAGGAAA AGGCCGTTGC GTTTGCTCAA
AAACACCACG TCAAGCGTGT GTATGACAGT ATCGACGATC TTCTCGCCAG CGACGAAATT
GACGCAGTGT ACCTGACCAC ACCTCATAAC ACGCACATCA TCTACCTGCG CAAAGCACTT
CAGGCGGGTA AGCACGTTCT GTGCGAGAAG TCTATTACGC TCAACTCCGC TGAGTTGCTT
GAGGCAGAAG AACTTGCACG TCAAAATGGC GTTCAGTTGA TGGATGCCTG TACCATTTTG
CACATGCCTC TCTACAAAGA GCTGGTTGGT CGCGTGGAGG CGGGCGAGTT TGGCCCAGTC
AATCTGATTC AAGAAAATTT TGGTAGCTAC AAAGAGTTTG ACATGGAGAA CCGATTCTTC
AATCCTAAGC TTGCTGGTGG CGCCCTTTTG GATATTGGCG TGTATTCGCT GACACTGGCT
CGTCTTTTCT TGAAGAGTCA GCCTCATGAC GTGCTCTCCA TGATGAATCC AGCTCCTACT
GGCGTTGATC AGACGGACGG CATTTTGCTG AGAAATGCCG AGGGTCAGAT GGTTGTTCTG
GCGCTGACAC TTCATTCTAA GCAGCCAAAG CGCGCCATGA TTTCCGCTGA TAAGGCCTTC
ATTGAGATTA TGGAGTACCC ACGCGCGGAC GTTGCTACCA TTACCTGGAC TGACGATGGC
AAGCAGGAGA AAGTTCATGT TGGGCGCACG GCCGATGCTC TGGCATACGT GCTGGCTGAC
CTGGAAGCTG CTGTTGCGGG AGATGCTTCT GCCCAGGCGC AACTTGAGGT CTCAAAGGAC
GTTATGGAGC TCATGACTAG CCTGCGCAAT GACTGGAATT TCCTGTACCC CGAGGAGCAA
TAA
 
Protein sequence
MSQTAGQMSQ SATSSAHNTA STMQSAMSDG DAIFTAPHLK WGVIGCGVIA NQMAEALASV 
GRTIDGVANR TQEKAVAFAQ KHHVKRVYDS IDDLLASDEI DAVYLTTPHN THIIYLRKAL
QAGKHVLCEK SITLNSAELL EAEELARQNG VQLMDACTIL HMPLYKELVG RVEAGEFGPV
NLIQENFGSY KEFDMENRFF NPKLAGGALL DIGVYSLTLA RLFLKSQPHD VLSMMNPAPT
GVDQTDGILL RNAEGQMVVL ALTLHSKQPK RAMISADKAF IEIMEYPRAD VATITWTDDG
KQEKVHVGRT ADALAYVLAD LEAAVAGDAS AQAQLEVSKD VMELMTSLRN DWNFLYPEEQ