Gene SbBS512_E3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3165 
SymbolhypF 
ID6270753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2955406 
End bp2957658 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content58% 
IMG OID641727081 
Productcarbamoyltransferase HypF 
Protein accessionYP_001881540 
Protein GI187730350 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GTGGCAAAGT GCAGGGCGTC 
GGTTTTCGTC CGTTTGTCTG GCAGCTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT
AATGACGGCG ATGGCGTAGA AGTCCGGCTG CGGGAAGACC CGGAAACGTT TCTTGTTCAA
TTGCATCAGC ACTGCCCGCC GCTGGCGCGT ATTGATAGCG TCGAGCGTGA GCCGTACATC
TGGTCACAAC TGCCCACTGA GTTCACCATC CGCCAGAGCG CGGGAGGTGC CATGAATACG
CAAATTGTCC CGGATGCCGC TACTTGCCCT GCTTGCCTTG CCGAAATGAA TACCCCAGGC
GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACCCACT GCGGCCCGCG TTTCACCATT
ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTGATGG CGGCGTTTCC GTTGTGTCCG
GCTTGTGACA AAGAGTACCG CGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC
TGCCCAGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC AGAACAAGAG
GCGGCATTAC AGGCGGCTAT CGCACAGTTA AAAATGGGCA ACATTGTCGC CATCAAAGGG
ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTTCGG
GCGCGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTCATGT TGCCAGTGGC TGACGGTTTA
CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA
AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGACC TTAACGAAGT CGGGGTAATG
TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG
ACCTCCGGCA ACCTGAGCGG TAAACCACCA GCTATCAGCA ACGAACAGGC GCTGGCGGAT
TTGCAGGGCA TTGCCGACGG ATTCTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT
GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT
GCGCTGGCTT TGCCTCCGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGAGCGGAT
CTGAAAAATA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC
GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC
ATCTACGATT TTACCCCGAA ATACGTTGTG CATGACGCGC ATCCGGGCTA TGTCTCCAGC
CAGTGGGCGC GTGAAATGAA TCTGCCGACG CAAACGGTAC TGCATCATCA TGCCCATGCA
GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG
CTCGACGGTA TCGGTATGGG GAAGAACGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG
AACTATCGCG AATGTGAGCA CCTGGGCGGC TTGCCTGCAG TGGCGCTTCC GGGTGGCGAT
TTGGCAGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGCTT TGTGCCGGAG
TGGCAGAATT ACCCTGAAAC AGCAAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG
GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGGTT GTTCGATGCA
GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT
GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC GATGCCGCTG
GTGGACAATC AACTGGATCT CGCCGCTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACAG
GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG
CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT
AACCGTTTGC TGCGTGCACG TCTGTCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCT
CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG
CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
 
Protein sequence
MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL REDPETFLVQ 
LHQHCPPLAR IDSVEREPYI WSQLPTEFTI RQSAGGAMNT QIVPDAATCP ACLAEMNTPG
ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA
CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGNIVAIKG IGGFHLACDA RNSNAVATLR
ARKHRPAKPL AVMLPVADGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APDLNEVGVM
LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNEQALAD LQGIADGFLI HNRDIVQRMD
DSVVRESGEM LRRSRGYVPD ALALPPGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG
DLSDDGIQMQ WREALRLMQN IYDFTPKYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA
AACLAEHQWP LDGGDVIALT LDGIGMGKNG ALWGGECLRV NYRECEHLGG LPAVALPGGD
LAAKQPWRNL LAQCLRFVPE WQNYPETASV QQQNWSVLAR AIERGINAPL ASSCGRLFDA
VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPL VDNQLDLAAF WQQWLNWQAQ
VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLSH YLADFTLLFP
QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG