Gene EcolC_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1000 
Symbol 
ID6067708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1086637 
End bp1088889 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content58% 
IMG OID641600408 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_001723996 
Protein GI170019042 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000948603 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GTGGCAAAGT GCAGGGCGTC 
GGTTTTCGTC CGTTTGTCTG GCAGCTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT
AATGACGGCG ATGGCGTAGA AGTCCGGCTG CGGGAAGACC CGGAAACGTT TCTTGTTCAA
TTGTATCAGC ACTGCCCGCC GCTGGCGCGT ATTGATAGCG TCGAGCGTGA GCCGTTTATC
TGGTCACAAC TGCCCACCGA GTTCACTATA CGCCAGAGCA CAGGCGGCAC CATGAATACG
CAAATTGTTC CCGATGCCGC TACTTGCCCT GCTTGCCTTG CCGAAATGAA TACCCCAGGC
GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACCCACT GCGGCCCGCG TTTCACCATT
ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTGATGG CGGCGTTTCC GCTATGTCCG
GCCTGTGACA AAGAGTACCG TGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC
TGCCCGGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC GGAACAAGAG
GCGGCATTAC AGGCAGCTAT CGCACAGTTA AAAATGGGCA AAATTGTCGC CATCAAAGGG
ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTTCGG
GCGCGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTCATGT TGCCAGTGGC TGACGGTTTA
CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA
AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGACC TTAACGAAGT CGGGGTAATG
TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG
ACCTCCGGCA ACCTGAGCGG TAAACCACCA GCTATCAGCA ACAAACAGGC GCTGGCGGAT
TTGCAGGGCA TTGCCGACGG ATTTTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT
GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT
GCGCTGGCTT TGCCTCCGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGCGCGGAT
CTGAAAAATA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC
GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC
ATCTACGATT TCACCCCGCA ATACGTTGTG CATGACGCAC ATCCGGGCTA TGTCTCCAGC
CAGTGGGCGC GCGAAATGAA TCTGCCGACG CAAACGGTGC TGCATCATCA TGCCCACGCA
GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG
CTCGACGGTA TCGGTATGGG GGAGAATGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG
AACTATCGCG AATGTGAGCA CCTGGGCGGC TTGCCTGCAG TGGCGCTTGC TGGTGGCGAT
TTGGCGGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGTTT TGTGCCGGAG
TGGCAGAATT ATCCCGAAAC AGCGAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG
GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGTTT TTTCGATGCA
GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT
GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC AATGCCGCGG
GTGGACAATC AACTGGATCT CGCCACTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACCG
GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG
CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT
AACCGTTTGC TGCGTGCACG TCTGGCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCA
CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG
CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
 
Protein sequence
MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL REDPETFLVQ 
LYQHCPPLAR IDSVEREPFI WSQLPTEFTI RQSTGGTMNT QIVPDAATCP ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA
CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGKIVAIKG IGGFHLACDA RNSNAVATLR
ARKHRPAKPL AVMLPVADGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APDLNEVGVM
LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNKQALAD LQGIADGFLI HNRDIVQRMD
DSVVRESGEM LRRSRGYVPD ALALPPGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG
DLSDDGIQMQ WREALRLMQN IYDFTPQYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA
AACLAEHQWP LDGGDVIALT LDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALAGGD
LAAKQPWRNL LAQCLRFVPE WQNYPETASV QQQNWSVLAR AIERGINAPL ASSCGRFFDA
VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPR VDNQLDLATF WQQWLNWQAP
VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLAH YLADFTLLFP
QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG