Gene Paes_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1643 
Symbol 
ID6458412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1787616 
End bp1789952 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content54% 
IMG OID642725631 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_002016308 
Protein GI194334448 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGCG TGGTTGAAAG CAGCAATGAA CCGATCCTGA GCGGGAGAGC AAGGCGCCTG 
CGTCTCGCGG TTAACGGGAT TGTTCAGGGT GTCGGGTTCC GGCCTTATGT CTACCGCCTT
GCGACATCTC TGGGGCTGAG CGGTTCGATT CGCAATACCG CTTCCGGCGT TTTGATCGAA
GTGCAGGCTG AGACTCCGTT GCTTCTGGAG CGCTTCGTTC AGGAGCTTGG TTCTGAAGCT
CCTCCTCTCG CAGAAATTTT TTCCGTTGTT GCCGATGAGA TTCCCGCTCA ATCGGGAAAC
GGATTTGTGA TTCTCTCTTC AGATGACGGT GAAGATGTCG AAACCCTGAT TCCTCCTGAT
ATCGCGCTGT GCGATGATTG TCGTTCAGAG CTTCTTGATC CTCTTGATCG ACGCTACCGC
TATGCGTTTA TCAATTGTAC GAATTGCGGG CCGCGCTTCA GTATTGTTGA GCGTCTTCCA
TACGACCGTC CTTTGACATC GATGAAGGGA TTCACGATGT GCGAGAAGTG CGGTCGGGAG
TACGGTAATC CTCTCGATCG TCGTTTTCAC GCGCAGCCGA ACGCCTGTCC GGATTGCGGG
CCGGCCCTTC AGTTACTTGA TGCAGGCGGA GAGCCGATCA ATGCGCCTGA TCCTCTGTCG
CTCGCACTCC AGAAGCTGAA AGATGGCGAT ATTGTCGCAG TCAAGGGGGT GGGGGGATTT
CATCTCGCTG TTGACGCAAC AGATCAGTCT GCCGTGATGA ATCTGCGAGA GAGAAAAGGG
AGGGAGCGAA AGCCTTTCGC CGTGATGGCA CGATCATTGG GCTCTGCTGA AAAAGTGGCT
CAGGTATCGG ATGAAGAACG TGTTGCGCTG CGCTCTTTGC AGGCTCCGGT TGTGCTGATG
CATAAGAAAG CATCCGCCTC GTTGCTGGCT CCTGACGTGG CTCCTGCTAA CGATCGGATC
GGCCTCATGC TTCCCTATTC TCCTCTTCAT GTTCTTATGA TGGAGGAGGG CCCGGAATTT
CTGGTGATGA CGAGTGCCAA CTCCAGCGAG GAGCCTATTG CTCTTGAAAA CGATGAGGCG
GTCAGTCGAT TGCAGGGGAT TGCAGATTAT TTTCTTGTCC ATAACCGGCC AATACATCTT
CGATGCGATG ATTCCGTCAC GATGGTGATG TCAGGGGCGT TGCGACAGAT CAGACGGGGT
CGCGGTTATG CCCCGCTGCC TGTTCTGCTT TCTTCGGATG GGCCTTCAGT GCTTGCGGCC
GGGGGTGAAA TGAAGAATAC GGTTTGTGTG CTTCAAGGAT CTCAGGCTCT TTTGAGCCAG
CATATCGGCG ATCTGAAAAA TTATGTGGCA TATGAGCATT TTCAGCAAGT TGCTGAGCAT
ATGCAGCATA TTTTTCAGGT TCGTCCCGAA GCGATTATCA GTGATATGCA TCCATCATAT
CTTTCGACGC AATGGGCTCA AAACCAGAGT GATATCCCCG TTCTTCATGT TCAGCATCAT
CATGCTCATC TGGTATCCTG TCTTGCGGAA AACCGTTTTG ACGGCCAGGC TGTCGGTATC
ATCCTTGATG GGACAGGATA TGGCACCGAT GGTACAGTCT GGGGCGGAGA GGTGCTTATC
GGAGACGCAT CGGGGTTCTT TCGTTTCGCG TCACTTGAAC CGGTCCGGAT GCCCGGAGGT
GATCGTGCAG CGCTTTTTCC GTGGAAAGCC GCTGCGGGAT ACCTGTTTCA TACTTATGGT
CATATTCCGG AGATAGCGGC TTTTGAGGGG TGCTTTGTCG AGGGTATTGC CGATCTTCTC
GCCAGACAGG TCAATGCACC TCCTGCCAGC AGTTGCGGCC GTCTTTTTGA TGCGGTTTCA
GCCCTGTGCG GGTTGTGCAG AGAAATCAGT TATGAGGGGC AGGCGGCTAT CGAACTGATG
CATGCTGCAG GCATTGCCGA GGGGAAACCC TTTGCCTGGG AGGTCGTTTC TGCTGGTTCA
GATCGGTGGC ATCTGTCTGT TGCACCGATG GTACAGGATA TTGTGACGGC ACTCGGTTCC
GCAATGAGTG TTTCGGACAT CAGTCGACGA TTTCATGTCA CGATTGTTAA AATGTTTTCT
GATATTGCTC TCAGGGCCTG TCGTTTTTCC GGCCTTACTT CGGTTGCCCT CAGCGGGGGA
GTTTTTCAGA ATCCGCTGGT GTTCGAAGGG CTTGTCAGCG ATCTTCAGCT GCACGGGATT
GAGGTGCTGA CCCATAGCCA GGTTCCATCG AACGATGGAG GCCTTTCACT CGGTCAGGCC
GTGATCGGAA GGCATTGGGT GAAGACGGGA TGCGCCTGTC GGCGCAGTGA CGCGTGA
 
Protein sequence
MGGVVESSNE PILSGRARRL RLAVNGIVQG VGFRPYVYRL ATSLGLSGSI RNTASGVLIE 
VQAETPLLLE RFVQELGSEA PPLAEIFSVV ADEIPAQSGN GFVILSSDDG EDVETLIPPD
IALCDDCRSE LLDPLDRRYR YAFINCTNCG PRFSIVERLP YDRPLTSMKG FTMCEKCGRE
YGNPLDRRFH AQPNACPDCG PALQLLDAGG EPINAPDPLS LALQKLKDGD IVAVKGVGGF
HLAVDATDQS AVMNLRERKG RERKPFAVMA RSLGSAEKVA QVSDEERVAL RSLQAPVVLM
HKKASASLLA PDVAPANDRI GLMLPYSPLH VLMMEEGPEF LVMTSANSSE EPIALENDEA
VSRLQGIADY FLVHNRPIHL RCDDSVTMVM SGALRQIRRG RGYAPLPVLL SSDGPSVLAA
GGEMKNTVCV LQGSQALLSQ HIGDLKNYVA YEHFQQVAEH MQHIFQVRPE AIISDMHPSY
LSTQWAQNQS DIPVLHVQHH HAHLVSCLAE NRFDGQAVGI ILDGTGYGTD GTVWGGEVLI
GDASGFFRFA SLEPVRMPGG DRAALFPWKA AAGYLFHTYG HIPEIAAFEG CFVEGIADLL
ARQVNAPPAS SCGRLFDAVS ALCGLCREIS YEGQAAIELM HAAGIAEGKP FAWEVVSAGS
DRWHLSVAPM VQDIVTALGS AMSVSDISRR FHVTIVKMFS DIALRACRFS GLTSVALSGG
VFQNPLVFEG LVSDLQLHGI EVLTHSQVPS NDGGLSLGQA VIGRHWVKTG CACRRSDA