Gene Apre_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1085 
Symbol 
ID8397872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1161903 
End bp1163636 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content37% 
IMG OID644995432 
Producthydrogenase, Fe-only 
Protein accessionYP_003152833 
Protein GI257066577 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAGT TAAAGATAAA TAATAGAGAA GTAGAAGTTA ATGAAGGATC TACAATCCTA 
GAAGCTGCAA GTATTTTAAA TATCAAAATT CCAACCTTAT GTCATATGGA TCTTCACGAT
ATAAAGTTTG TAAATAAGCT TGCTTCCTGC AGGGTATGTT TAGTAGAAGA TATAGACAAG
GATAAATTAA TTCCTTCATG TGCTACTTTC GTCAAAGAAG GAATGAATAT TAGAACGGAT
AGCAAAAGAG TGATCAGGGC GAGAAGAGCT ATTGTCGAGC TACTCTTATC AGATCATCCA
TCCGATTGCC TGAAATGTGC CAAAAACCTG GACTGTACTC TCCAAGAGCT CGCCTGTGAC
TTAAATATTA GAGAGATAAG ATACAGAGGA GAGATGAGGA GTCTACCAAT TGATGATAAT
TCATATTCAC TAATTAGAGA TCCAAACAAG TGCATTCTTT GTAGAAGATG TGAGACTATG
TGTAACGAGG TTCAAACCGT TGGAGCTCTT GCAGAAGTTG GAAGGGGTTT TTATACTCAT
GTTGGATCAA CCTTCAATAG GTCCATGTTT GAGACGACTT GTACCTTTTG TGGCCAATGT
CTATCAGTAT GTCCTACAGG GGCTCTTACA GAAAAGTCAA ATATACCAGA AGTCTGGAGG
GCCCTTTCAT CTGACAAACA TGTAATAGTA CAAGTTGCAC CAGCAGTTAG GGTTGCCCTG
GGTGAGATGT TTGGTATTAG AGTAGGAACT AATGTCGAAG GAAAAATCGT AACAGCTTTA
AGGAGACTAG GATTTGATAG GGTATTTGAT ACGAATTTTG CGGCAGACCT TACAATAATG
GAAGAAGCAA ACGAGTTTGT AGATAGACTA AAGGGAAAAG GAGAGCTTCC AATTCTTACA
TCTTGTTGTC CAGGTTGGGT TAATTTCATG GAACAACAGT TTTCTGACAT GATCGATATA
CCTTCCACAT GCAAGTCCCC TCATGAGATG TTTGGAGCAA TTGCTAAGTC CTACTATGCT
GAAAAAGAAG GAATTAATCC AGAAGATATA GTTGTAGTAT CTGTTATGCC TTGTATTTCT
AAGAAATATG AAGCTAAAAG AGATGAGCTA GAAAATGAAG GCTACTCTGA TGTTGATACA
GTAATTACGA CAAGGGAGTT AGCAGAGATG ATTAAGGAAG TTGGAATTGA CTTTGCTTCT
TTGGAAGATA GTGATTTCGA TAACCCTATG GGAGAGTCTA CAGGAGCTGG TGACATATTT
GGTACGAGTG GTGGAGTAAT CGAAGCTACA GTACGTACGG CCTATAATAT AATTACAGAA
AAAGACTTAG AAAAAGTTGA GTTTTATGAT TTAAGAGGTC TTAGAGGAAT AAAATACGCT
ACAGTTGATA TAGAGGGAAG AGAAATTAAG ATTGCGGTTG CCAATGGTTT GGGAAATACG
AGAAGACTTC TAGAAAAATT AAAGAATAAA GAAATATCCC TAGACGCTAT CGAGGTAATG
GCTTGTCCAG GAGGTTGTAT CGGTGGAGGA GGTCAACCTT ATCATCATGG AGATATTTCT
ATCTTAAAAA AAAGATCAGA AGGTCTCTAC AAATTGGATG AATCCAAAAA GCTTAGAAAA
TCTTATGAGA ATCCTTATAT TAAAGACCTC TATGATGAAT ACTTGAAAGA GCCTGGTTCA
GAGAAAGCAC ATAATCTTCT TCATACATCA TACAAGGCAT CTCCAAAATT ATAA
 
Protein sequence
MLKLKINNRE VEVNEGSTIL EAASILNIKI PTLCHMDLHD IKFVNKLASC RVCLVEDIDK 
DKLIPSCATF VKEGMNIRTD SKRVIRARRA IVELLLSDHP SDCLKCAKNL DCTLQELACD
LNIREIRYRG EMRSLPIDDN SYSLIRDPNK CILCRRCETM CNEVQTVGAL AEVGRGFYTH
VGSTFNRSMF ETTCTFCGQC LSVCPTGALT EKSNIPEVWR ALSSDKHVIV QVAPAVRVAL
GEMFGIRVGT NVEGKIVTAL RRLGFDRVFD TNFAADLTIM EEANEFVDRL KGKGELPILT
SCCPGWVNFM EQQFSDMIDI PSTCKSPHEM FGAIAKSYYA EKEGINPEDI VVVSVMPCIS
KKYEAKRDEL ENEGYSDVDT VITTRELAEM IKEVGIDFAS LEDSDFDNPM GESTGAGDIF
GTSGGVIEAT VRTAYNIITE KDLEKVEFYD LRGLRGIKYA TVDIEGREIK IAVANGLGNT
RRLLEKLKNK EISLDAIEVM ACPGGCIGGG GQPYHHGDIS ILKKRSEGLY KLDESKKLRK
SYENPYIKDL YDEYLKEPGS EKAHNLLHTS YKASPKL