Gene B21_02527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02527 
SymbolhypF 
ID8113958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2674348 
End bp2676600 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content57% 
IMG OID644848727 
Producthypothetical protein 
Protein accessionYP_003000300 
Protein GI251785996 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GTGGCAAAGT GCAGGGCGTC 
GGTTTTCGTC CGTTTGTCTG GCAGTTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT
AATGACGGCG ATGGCGTGGA AGTCCGGTTG CTGGAAGACC CGGAAACGTT TCTTGTTCAA
TTGCATCAGC ACTGCCCGCC GCTGGCGCGT ATTGATAGCG TCGAGCGTGA GCCGTTTATC
TGGTCACAAC TGCCCACTGA GTTCACTATC CGCCAGAGCA CAGGCGGCAC CATGAATACG
CAAATTGTCC CGGATGCCGC CACTTGCCCT GCTTGCCTTG CCGAAATGAA TACCCCAGGC
GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACTCACT GTGGTCCGCG TTTCACCATT
ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTAATGG CGGCGTTTCC GCTGTGTCCG
GCCTGTGATA AAGAGTACCG TGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC
TGCCCGGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC AGAACAAGAG
GCGGCATTAC AGGCAGCTAT CGCACAGTTA AAAATGGGCA ACATTGTCGC CATCAAAGGG
ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTGCGG
GCACGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTTATGT TGCCAGTGGC AGAAGGTTTA
CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA
AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGACC TTAACGAAGT CGGGGTAATG
TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG
ACCTCCGGCA ACCTGAGCGG TAAACCACCA GCTATCAGCA ACGAACAGGC GCTGGCGGAT
TTGCAGGGCA TTGCCGACGG ATTCTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT
GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT
GCGCTGGCTT TGCCTCCGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGCGCGGAT
CTGAAAAATA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC
GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC
ATCTACGATT TTACCCCGCA ATACGTTGTG CATGACGCGC ATCCGGGCTA TGTCTCCAGC
CAGTGGGCGC GTGAAATGAA TCTGCCGACG CAAACGGTAC TGCATCATCA TGCCCATGCA
GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG
CTCGACGGTA TCGGTATGGG GGAGAATGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG
AACTATCGCG AATGTGAGCA CCTGGGCGGC TTGCCTGCAG TGGCGCTTCC GGGTGGCGAT
TTGGCAGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGCTT TGTGCCGGAG
TGGCAGAATT ATCCCGAAAC AGCAAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG
GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGTTT TTTCGATGCA
GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT
GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC AATGCCGCGG
GTGGACAATC AACTGGATCT CGCCACTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACCG
GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG
CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT
AACCGTTTGC TGCGTGCACG TCTGGCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCA
CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG
CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
 
Protein sequence
MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL LEDPETFLVQ 
LHQHCPPLAR IDSVEREPFI WSQLPTEFTI RQSTGGTMNT QIVPDAATCP ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA
CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGNIVAIKG IGGFHLACDA RNSNAVATLR
ARKHRPAKPL AVMLPVAEGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APDLNEVGVM
LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNEQALAD LQGIADGFLI HNRDIVQRMD
DSVVRESGEM LRRSRGYVPD ALALPPGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG
DLSDDGIQMQ WREALRLMQN IYDFTPQYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA
AACLAEHQWP LDGGDVIALT LDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALPGGD
LAAKQPWRNL LAQCLRFVPE WQNYPETASV QQQNWSVLAR AIERGINAPL ASSCGRFFDA
VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPR VDNQLDLATF WQQWLNWQAP
VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLAH YLADFTLLFP
QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG