Gene EcHS_A2848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2848 
SymbolhypF 
ID5592312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2852591 
End bp2854843 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content58% 
IMG OID640921965 
Productcarbamoyltransferase HypF 
Protein accessionYP_001459476 
Protein GI157162158 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GTGGCAAAGT GCAGGGCGTC 
GGTTTTCGTC CGTTTGTCTG GCAGCTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT
AATGACGGCG ATGGCGTAGA AGTCCGGCTG CTGGAAGACC CGGAAACGTT TCTTGTTCAA
TTGCATCAGC ACTGTCCGCC ACTGGCGCGT ATTGATAGCG TCGAGCGTGA GCCGTTTATC
TGGTCACAAC TGCCCACCGA GTTCACTATC CGCCAGAGCG CGGGCGGTGC CATGAATACG
CAAATTGTCC CGGATGCTGC CACTTGCCAT GCTTGCCTTG CCGAAATGAA TACCCCAGGC
GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACCCACT GCGGCCCGCG CTTCACCATT
ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTGATGG CGGCGTTTCC GCTGTGTCCG
GCCTGTGATA AAGAGTACCG TGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC
TGCCCGGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC AGAACAAGAG
GCGGCATTAC AGGCAGCTAT CGCACAGTTA AAAATGGGCA ACATTGTCGC CATCAAAGGG
ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTGCGG
GCACGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTTATGT TGCCAGTGGC AGAAGGTTTA
CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA
AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGGCC TTAACGAAGT CGGGGTGATG
TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG
ACCTCCGGCA ACCTGAGCGG TAAACCACCG GCTATCAGCA ACGAACAGGC GCTGGCGGAT
TTGCAGGGCA TTGCCGACGG ATTCTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT
GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT
GCGCTGGCTT TGCCTCTGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGCGCGGAT
CTGAAAAACA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC
GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC
ATCTACGATT TCACTCCGCA ATACGTTGTG CATGACGCGC ATCCGGGCTA TGTCTCCAGC
CAGTGGGCGC GCGAAATGAA TCTGCCGACG CAAACGGTGC TGCATCATCA TGCCCACGCA
GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG
CTCGACGGTA TCGGTATGGG GGAGAACGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG
AACTATCGCG AATGCCAGCA CCTGGGCGGC TTGCCTGCGG TGGCGCTTCC GGGTGGCGAT
TTGGCAGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGCTT TGTGCCGGAG
TGGCAGAATT ACTCTGAAAC AGCAAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG
GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGTTT TTTCGATGCA
GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT
GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC AATGCCGCGG
GTGGACAATC AACTGGATCT CGCCACTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACCG
GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG
CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT
AACCGTTTGC TGCGTGCACG TCTGGCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCA
CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG
CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
 
Protein sequence
MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL LEDPETFLVQ 
LHQHCPPLAR IDSVEREPFI WSQLPTEFTI RQSAGGAMNT QIVPDAATCH ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA
CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGNIVAIKG IGGFHLACDA RNSNAVATLR
ARKHRPAKPL AVMLPVAEGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APGLNEVGVM
LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNEQALAD LQGIADGFLI HNRDIVQRMD
DSVVRESGEM LRRSRGYVPD ALALPLGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG
DLSDDGIQMQ WREALRLMQN IYDFTPQYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA
AACLAEHQWP LDGGDVIALT LDGIGMGENG ALWGGECLRV NYRECQHLGG LPAVALPGGD
LAAKQPWRNL LAQCLRFVPE WQNYSETASV QQQNWSVLAR AIERGINAPL ASSCGRFFDA
VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPR VDNQLDLATF WQQWLNWQAP
VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLAH YLADFTLLFP
QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG