Gene Apar_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0163 
Symbol 
ID8413009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp188478 
End bp189857 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content46% 
IMG OID645021733 
Producthypothetical protein 
Protein accessionYP_003179190 
Protein GI257783973 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCATA AAGAAATTAC CTGGAATAAC CTACGAATTC CTGTTGTTGG AAGTTTTGAT 
GTAGTTATTA TTGGCGGCGG TGTTTCTGGT TCTGCCTGTG GAATTCGCTG TGTTAATGAA
GGTCTTTCAA CGCTCATTGT GGATAAGTCA AACTTGCTAG GAGGATCTGC TACACGAGCG
CTTGTATGCC CGATGATGCC TACGTATGTC CGTCACCTGC CTGTTCTTTC TTCCATTGAG
CAAGAGCTTT TGGCTTCAGG TGACGCTACG CGTGATAATT ACACTACCAT GATTTGGTTT
GCTCCTGAGC GCTTGGGAGA AGTATATGAG CAGCTTTACG CTGCTAAGGA CGGTCAGATT
CTCTACGATA CGTCTCTTGT AGCTGTAGTT TTATCTGATT CTGATGGCGA GAAAACCATT
ACGCATGCAA TTCTCGCCTC AACCGAGGGC CTGATTGCAG TAGCTGCTCA AACGTGGATT
GATGCATCTG GCGATGCGGT TTTGTCCAGA GCTGCTGGCG TACCAGTTGC AGCTGGTGAT
AAGGATGGCA TTAATCAGGT TTGCAGCCTG CGTTTTACCA TGGGTGGTAT TGATGTTGAG
CGTTATAGGG ATTACGTGCT TTCGCTTGAT GATCACTTCT CACCATTAGT TGATGGATAT
TTCTTTGAGT CTGCAATGGT TGCGGGTAAA AATTTTAAGC TGGAGCCAAA GTTCCGCGAG
GGAATTGAAG CAGGCATTCT TGAAGAAGAG GACTTGCGCT ACTATCAGAT TTTCTCGCTT
CCCGGAAAAC CAGGTTGCAT GGCATTAAAC TGCCCACATA TTGCAAGTAT GCGTACAAAC
ACTACGGGAG CTGCACGTTC CAATGCAACG ATTGAAGCTC ACGCCCGCAT TCGCCGGCTT
GTTACTTTCC TGCAGCGTAT GATGCCTGGA TTTGAACAAA GCTATCTTAT GGAGCAGGCT
TCACTTTTGG GTGTTCGTGA GAGCTGGCGC GTTGAAGGCC AGGTAATGTT GACTGAAGAG
GATTACGTTA ACCAGGCCCG ATTTGATGAC GGTCTTGTTC GTGGCGATTG GTACATTGAT
GTTCACTCCA ATAAGGGTGG ACTCTTCCAT AAAAATACGT ACAAACAAGG TGACTATTAC
GAGATTCCGT TCCGTTCGAT GGTCACCAAA CATATCAGCA ACCTGGGTGT TATTGGTCGT
TGTATCTCTA CGACTTTTTT GATGCAGGCA AGTGTTCGCA TTATTCCAAC GGTGACCGAT
ATGGGAGATG CTATGGGTAC AGCGTGTGTT CTCGCAAAAC GTACGTCAAC ACCATTGGCC
AAATTGGATG GCGCGGCGGT TCGTGCAGAA GTAGAACAAA TGAAAGCGTT GAATTTATGA
 
Protein sequence
MIHKEITWNN LRIPVVGSFD VVIIGGGVSG SACGIRCVNE GLSTLIVDKS NLLGGSATRA 
LVCPMMPTYV RHLPVLSSIE QELLASGDAT RDNYTTMIWF APERLGEVYE QLYAAKDGQI
LYDTSLVAVV LSDSDGEKTI THAILASTEG LIAVAAQTWI DASGDAVLSR AAGVPVAAGD
KDGINQVCSL RFTMGGIDVE RYRDYVLSLD DHFSPLVDGY FFESAMVAGK NFKLEPKFRE
GIEAGILEEE DLRYYQIFSL PGKPGCMALN CPHIASMRTN TTGAARSNAT IEAHARIRRL
VTFLQRMMPG FEQSYLMEQA SLLGVRESWR VEGQVMLTEE DYVNQARFDD GLVRGDWYID
VHSNKGGLFH KNTYKQGDYY EIPFRSMVTK HISNLGVIGR CISTTFLMQA SVRIIPTVTD
MGDAMGTACV LAKRTSTPLA KLDGAAVRAE VEQMKALNL