Gene Apre_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1371 
Symbol 
ID8398181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1475959 
End bp1477485 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content40% 
IMG OID644995736 
Producthydrogenase large subunit domain protein 
Protein accessionYP_003153115 
Protein GI257066859 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000870686 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT CATACATAGA CATACTTAAT ATTAGAAGGA TGGCCTTTGA GGCAGTTGCC 
AAAATTGCCT ACGAAAACAG GCCAATTACA GACATAGCCC TAGAAGTTTT TGATATTCTT
CCAGGAGAAG AAGCGAGATA TAGGGAAAAT ATCTTTAGAG AAAGAGCAGT AATGGGTGAG
AGACTTAGAA TGTGTATAGG TCTTGACGCA AGAAGTGCTG CTGATACAGA CGCTGTCACA
GAAGGCTTAG ATGCGATGGA TTATGACAGA CGAATCTACA ATCCACCTTT AGTTTCTGTT
ATAAAAATCG CCTGTGAAGC ATGTCCTGAA AACAAAGTCA CAGTAACAGA CCAATGCCAT
GCTTGTATTG GACATCCTTG TGTAAATGTC TGCCCTAAAA ACGCTGTTAC ATATACTGCC
AAGGGAGCAA TAATAGATCA AGATAAATGT ATCAAGTGCG GTAAGTGTGT TGCTGCTTGT
CCATATCAAG CAATCAACCA CCAAAAAAGA CCTTGTGCAG AATCTTGTGG AGTTAAGGCA
ATCGGCTCTG ATGAATTAGG TCGTGCTAAA ATCGACGAAG ATAAGTGCGT AGCTTGTGGT
AGATGTATTA TAACTTGCCC ATTTGGAGCA ATCTCAGATA AGAGTGAGAT CTACCAACTT
ATCAAGTCTC TACAATCAGA TAGGAAAGTT TATGCTATAA TTGCACCATC ATTTGTCGGA
CAATTTGGTG TAAATGTAAG CCCTGAACAA ATCAGAGAGG CTATCAAACA ACTTGGCTTT
GACGATGTTA TAGAAGTAGG CCTTGGAGCT GACCTAACTA CAATGAACGA AGCCCATGAG
TATTTAGAAC AAGTTCCAAC AGGCAAGATT CCATTTATGG GAACAAGCTG CTGTTTCTCA
TGGAAGTTAA TGGTAAGAAA CCAATTCCCA GATATAAATG ATAAGATTTC TGAATCATCA
ACACCAATGA TCTATTCAGG AAAACAAATG AAAAAACGTG ATCCAAACTG TGAAGTAGTA
TTTATAGGAC CATGTATATC CAAGAAACTT GAAGCTCTTG AAGAAGAAGT AGCTGAAGTA
ATAGACTTTG TAATAACCTA CGAAGAGCTT CTAGGAATGT TCCTTGCTAA AGGAATCGAA
CCATCAGAAA TCGAAGTAGA TGAGCCAATG ATGGATGCCT CAGAGACAGG AAGATTCTAT
GCAGTAAGTG GTGGTGTTGC CGAAGCTGTT AAGAGAAGAG TGGGAGAAAT CGATCCGGAT
GCCAAGGTTG AAGTAGAAAA TGCTGAAGGA TTAGACAACT GTGTCAAGCT TGCAAGAATG
GCCAAACTCG GCAGGATGGA CGGCAAGCTT ATCGAAGGCA TGGCTTGTAT GGGAGGCTGT
GTCGGAGGAC CTGGAACAGT AGTTGCAGAA AATAAGACCG GCAAGAAAGT TAAGGCCTTC
GCAGCTGAAT CAATCTACAG ATCTCCTGCT GATAACGAGA ATATTCCAGT AGAAGACAGA
CCAGATGATA GCAAATTACA AAAATAA
 
Protein sequence
MKESYIDILN IRRMAFEAVA KIAYENRPIT DIALEVFDIL PGEEARYREN IFRERAVMGE 
RLRMCIGLDA RSAADTDAVT EGLDAMDYDR RIYNPPLVSV IKIACEACPE NKVTVTDQCH
ACIGHPCVNV CPKNAVTYTA KGAIIDQDKC IKCGKCVAAC PYQAINHQKR PCAESCGVKA
IGSDELGRAK IDEDKCVACG RCIITCPFGA ISDKSEIYQL IKSLQSDRKV YAIIAPSFVG
QFGVNVSPEQ IREAIKQLGF DDVIEVGLGA DLTTMNEAHE YLEQVPTGKI PFMGTSCCFS
WKLMVRNQFP DINDKISESS TPMIYSGKQM KKRDPNCEVV FIGPCISKKL EALEEEVAEV
IDFVITYEEL LGMFLAKGIE PSEIEVDEPM MDASETGRFY AVSGGVAEAV KRRVGEIDPD
AKVEVENAEG LDNCVKLARM AKLGRMDGKL IEGMACMGGC VGGPGTVVAE NKTGKKVKAF
AAESIYRSPA DNENIPVEDR PDDSKLQK