Gene Apar_0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0411 
Symbol 
ID8413260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp474447 
End bp476042 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content46% 
IMG OID645021979 
ProductFerredoxin hydrogenase 
Protein accessionYP_003179433 
Protein GI257784216 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0687694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGCA CAATGCGAGG TGTATATACC AACCTGACTG AGATTCGCAG GAAGGTATTT 
AAAGAGGTTT CGACGCTTGC ATACGAAAAA GCAGACAAGA GCGTCGATGA AATTGCTCGC
CAGATGGAAG AGCTGCCTTA TAAGATTATT CCTGGTGATA TTGCAACTTA CCGTGAGAGC
GTCTTTTTGG AACGCGCTAT TGTTGGTGAG CGTATTCGCA TGACTATGGG TATGCCTATG
CAGGGCGTAG ACCAACCCGA GAATATCTCT GATGGTTTAG AAGAAGCTTC TCGACCAGAG
GTTTATTATC AGCCCCCTCT GATCAACATT GTTAAGTTTG CCTGCAACGC ATGTGAAGAC
AATGTTTATC GCGTTACTAA TGCTTGCCAG GGCTGTCTTG CACACCCTTG TCGAGAAATC
TGTCCTAAAG AGGCAATTAG CTTTGTAGAC AAAAAAGCAT ATATTGACCA GGAGAAGTGT
ATTCAGTGCG GTATGTGCTT TAAGGTTTGT CCTTATCAGG CAATCCATCA CCACGTACGT
CCTTGCGCCG CTGCTTGTGG CATGGACGCC ATTGGCTCCG ATGAGCACGG CCGCGCAGAC
ATTGATTACG AGAAGTGCGT ATCTTGTGGT CAGTGTCTTG TTAACTGTCC CTTTGGTGCA
ATTGCCGATA AAAGCCAAAT TTTCCAAGTA ATTCAGGCTA TCAACGAAGG CTACACGGTT
GTTCCAATTG TGGCTCCTTC GTTTGTTGGT CAGTTTGGCA AGGGTAGTGT TGGTCGTCTT
CGTGAAGCCT TCAAGGAGAT GGGTTTTGCT GAGGTAGAAG AGGTTGCAAC TGGTGCTGAC
CTCTGTACCG TTCAGGAAGC AGAAGACTTT GTCAAAGAAG TTCCAGATAA GCTACCATTT
ATGGGTACAA GTTGCTGCCC TTCTTGGTCT GCAATGGCCA AAAAAGAGTA CCCAGAGCAT
GCTGATGCCA TCTCTATGGC ACTTACTCCA ATGGTTTTGA CTGCTCGTTT GGTAAGAAAA
CAAAATCCAG AGTCTAAGAT TGTCTTTATT GGACCATGTA CTGCTAAGAA ACTTGAGGCA
CTGCGTCGGT CCGTTAAGTC CGAGGTTGAT TTTGTTCTGA CTTTTGAAGA GCTTGCAGGA
ATGATGGAGT CTAAGGGCAT TGACTACACC AAGCTTGATG ACGACAATTC CGACTTTGAA
AATGCTTCTC ATGATGGCCG TGCCTTTGCA GTTGCAGGCG GAGTCGCAGG TGCAGTAGTA
AACGTCATTC ATCAGAAGTA CCCTGATAAA GAAGTGCCTA TCATGGCTGT TGACGGTCTT
GCTGAGTGTC GAGCTATGCT TAAGGATGCC GTTAAAGGTA AATATCCAGG CTATCTTCTT
GAGGGTATGG CCTGTCCAGG AGGATGTGTT GGTGGCGCTG GTACTTTAAG TGCAGTTAAC
CGTGCTGCAG CAGCAGTTAA GCGCTATGCA AAGACCGCTT CTTCCGAATT TGCTTCTGCT
AATAAGTACA ATAATCTTAT TCCAGAACTT GCTGGCACTG TCTCTGCCGA GGTTGAGATT
GCCGAGGTTC TTGAGGTTCA GAAACCTGTT GAGTAA
 
Protein sequence
MPGTMRGVYT NLTEIRRKVF KEVSTLAYEK ADKSVDEIAR QMEELPYKII PGDIATYRES 
VFLERAIVGE RIRMTMGMPM QGVDQPENIS DGLEEASRPE VYYQPPLINI VKFACNACED
NVYRVTNACQ GCLAHPCREI CPKEAISFVD KKAYIDQEKC IQCGMCFKVC PYQAIHHHVR
PCAAACGMDA IGSDEHGRAD IDYEKCVSCG QCLVNCPFGA IADKSQIFQV IQAINEGYTV
VPIVAPSFVG QFGKGSVGRL REAFKEMGFA EVEEVATGAD LCTVQEAEDF VKEVPDKLPF
MGTSCCPSWS AMAKKEYPEH ADAISMALTP MVLTARLVRK QNPESKIVFI GPCTAKKLEA
LRRSVKSEVD FVLTFEELAG MMESKGIDYT KLDDDNSDFE NASHDGRAFA VAGGVAGAVV
NVIHQKYPDK EVPIMAVDGL AECRAMLKDA VKGKYPGYLL EGMACPGGCV GGAGTLSAVN
RAAAAVKRYA KTASSEFASA NKYNNLIPEL AGTVSAEVEI AEVLEVQKPV E