Gene EcE24377A_2996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2996 
SymbolhypF 
ID5587796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2995842 
End bp2998094 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content58% 
IMG OID640926644 
Productcarbamoyltransferase HypF 
Protein accessionYP_001464020 
Protein GI157156292 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GAGGCAAAGT GCAGGGCGTC 
GGTTTTCGTC CGTTTGTCTG GCAGCTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT
AATGACGGCG ATGGCGTAGA AGTCCGGCTG CGGGAAGACC CGGAAACGTT TCTTGTTCAA
TTGCATCAGC ACTGCCCGCC GCTGGCGCGG ATTGATAGCG TCGAGCGTGA GCCGTACATC
TGGTCACAAC TGCCCACTGA GTTCACCATC CGCCAGAGCG CGGGAGGTGC CATGAATACG
CAAATTGTCC CGGATGCCGC TACTTGCCCT GCTTGCCTTG CCGAAATGAA TACCCCAGGC
GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACCCACT GCGGCCCGCG TTTCACCATT
ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTGATGG CGGCGTTTCC GTTGTGTCCG
GCTTGTGACA AAGAGTACCG CGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC
TGCCCGGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC AGAACAAGAG
GCGGCATTAC AGGCGGCTAT CGCACAGTTA AAAATGGGCA ACATTGTCGC CATCAAAGGG
ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTTCGG
GCGCGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTCATGT TGCCAGTGGC TGACGGTTTA
CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA
AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGACC TTAACGAAGT CGGGGTAATG
TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG
ACCTCCGGCA ACCTGAGCGG TAAACCACCA GCTATCAGCA ACGAACAGGC GCTGGCGGAT
TTGCAGGGCA TTGCCGACGG ATTCTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT
GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT
GCGCTGGCTT TGCCTCCGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGCGCGGAT
CTGAAAAATA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC
GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC
ATCTACGATT TTACCCCGCA ATACGTTGTG CATGACGCGC ATCCGGGCTA TGTCTCCAGC
CAGTGGGCGC GTGAAATGAA TCTGCCGACG CAAACGGTAC TGCATCATCA TGCCCATGCA
GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG
CTCGACGGTA TCGGTATGGG GGAGAACGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG
AACTATCGCG AATGTGAGCA CCTGGGCGGC TTGCCTGCAG TGGCGCTTCC GGGTGGCGAT
TTGGCAGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGCTT TGTGCCGGAG
TGGCAGAATT ACCCTGAAAC AGCAAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG
GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGGTT GTTCGATGCA
GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT
GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC GATGCCGCTG
GTGGACAATC AACTGGATCT CGCCACTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACCG
GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG
CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT
AACCGTTTGC TGCGTGCACG TCTGGCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCA
CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG
CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
 
Protein sequence
MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL REDPETFLVQ 
LHQHCPPLAR IDSVEREPYI WSQLPTEFTI RQSAGGAMNT QIVPDAATCP ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA
CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGNIVAIKG IGGFHLACDA RNSNAVATLR
ARKHRPAKPL AVMLPVADGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APDLNEVGVM
LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNEQALAD LQGIADGFLI HNRDIVQRMD
DSVVRESGEM LRRSRGYVPD ALALPPGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG
DLSDDGIQMQ WREALRLMQN IYDFTPQYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA
AACLAEHQWP LDGGDVIALT LDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALPGGD
LAAKQPWRNL LAQCLRFVPE WQNYPETASV QQQNWSVLAR AIERGINAPL ASSCGRLFDA
VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPL VDNQLDLATF WQQWLNWQAP
VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLAH YLADFTLLFP
QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG