Gene CPR_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1007 
SymboldhaF 
ID4206514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1147962 
End bp1149812 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content33% 
IMG OID642565564 
Productglycerol dehydratase reactivation factor, large subunit 
Protein accessionYP_698330 
Protein GI110803450 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0315037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA TAGCAGGAAT TGATATAGGA AACTCTTCTA CAGAAACTGC TTTAGGTAAG 
GTTTATGAAA ATAATGTTGA ATTTCTGTCT AGTGGGATAA TTCCCACTAC AGGAATCAAA
GGAACGGAAG AAAATATAAG TGGGGTAATA GCTTCTTTAA ACCAAGCTTT AAAGAAAGCT
AACTTAACTT TAGAAGATTT AGATTTAGTT AGAATTAACG AAGCAGCACC TGTTATAGGG
GATGTTGCTA TGGAGACAAT AACTGAAACA ATAATAACTG AATCAACAAT GATAGGACAT
AACCCTTCTA CTCCAGGAGG ATTAGGTATA GGAATAGGTA AAACTATAAG ATTAGAAACT
TTAGAAACTT TAAATATTGA TGAAATCAAA GAGGAAGATA ATGCTTTTAT TCCATTGGTT
TTAGGAAATA TAAGTTTTTT AGAGGCTGTA TTTAGAATAA ATCAAGCAAC TAGAAGAGGT
ATTAATATAA CTGCTGCCAT TGTTCAAAAG GATGATGGAG TATTAATTAA TAACAGATTA
GATAAAAAAA TACCTATAGT TGATGAAGTT TCTCTTTTAG AAAAGGTTCC TATAAATATG
AAAGCAGCTG TTGAAGTAGC ACCACAAGGT TCAGTAATTA GACAGCTATC AAATCCATAT
GGAATAGCTA CTGTTTTTGA CTTAAGTCCA GAAGAGACAA AAATGATTGT TCCTGTATCT
AGAGCCTTAA TAGGAAATAG ATCAGCTGTT GTAATAAAAA CTCCTCAAGG GGATGTTAAG
GAAAAGAAAA TACCAGCTGG TAAGATTAAC ATAAAAGGAA TAAGAAGAAA AGAGTCTGTG
GATGTTGAAG AGGGAGCAGA TAAAATAATG GAGGCTGTTA GCCTTTGCTC TCCAATAGAA
GATTTAAGAG GAGATGCTGG TTCTAATGTT GGAGGAATGC TTGAAAAAGT TAGACAAGTT
ATGGCTGATT TAACTAATCA AAGCATTTCA GATATAAAGA TTCAGGATTT ATTAGCAGTA
GATACTTTTA TCCCGCAAAA GGTTAAGGGT GGACTTGCAA AAGAGTTTTC AATGGAGAAT
GCTGTTGGAA TAGCAGCAAT GGTTAAAGCT CATAAACTTC AAATGCAAAT AATAGCTAAT
AAACTTGAAG AAAAGTTAGG AGTTCCAGTA GAAGTTGGTG GAGTAGAAGC TGACATGGCA
ATAAGAGGAG CTTTAACAAC TCCAGGTACA AATACTCCTT TAGCTATTTT AGATATGGGA
GCTGGATCTA CAGATGCTTC TATTATAAAT AAAGAAGGTA ACATAACATC AATACATTTA
GCTGGTGCAG GAAACATGGT AACTATGCTT ATTAAATCAG AATTAGGTAT AGAAGACTTT
GGACTTGCAG AAGATATAAA AAAATATCCA TTAGCTAAGG TTGAGAGTTT GTTCCATATA
AGACATGAAG ATGGAACTGT AGAGTTCTTT GAAAAACCTT TAGATTCTTC AGTTTTTGCT
AAGATTGTAA TTATTAAAGA GGGAATGCTT ATTCCTGTAG ATGGACAGAA TTCTTTAGAA
AAGATTAAAA ATGTTAGAAA AACTGCTAAG GAAAGAGTTT TCGTAATTAA CTGTTTAAGA
GCATTAAAAA GTGTTTCACC TACAGGAAAT ATAAGAGATA TAGAATTCGT TGTTTTAGTT
GGAGGATCAT CATTAGACTT TGAGGTTCCT GAATTAGTAA CAGATGCCTT ATCTCATTAT
GGAGTAGTTG CAGGAAGAGG AAATATAAGA GGATGTGAAG GTCCAAGAAA TGCAGTTGCA
ACTGGACTAG TATTAGCCTT TGACAGAAAG GGTGTCAAGG AAAATGATTA A
 
Protein sequence
MKIIAGIDIG NSSTETALGK VYENNVEFLS SGIIPTTGIK GTEENISGVI ASLNQALKKA 
NLTLEDLDLV RINEAAPVIG DVAMETITET IITESTMIGH NPSTPGGLGI GIGKTIRLET
LETLNIDEIK EEDNAFIPLV LGNISFLEAV FRINQATRRG INITAAIVQK DDGVLINNRL
DKKIPIVDEV SLLEKVPINM KAAVEVAPQG SVIRQLSNPY GIATVFDLSP EETKMIVPVS
RALIGNRSAV VIKTPQGDVK EKKIPAGKIN IKGIRRKESV DVEEGADKIM EAVSLCSPIE
DLRGDAGSNV GGMLEKVRQV MADLTNQSIS DIKIQDLLAV DTFIPQKVKG GLAKEFSMEN
AVGIAAMVKA HKLQMQIIAN KLEEKLGVPV EVGGVEADMA IRGALTTPGT NTPLAILDMG
AGSTDASIIN KEGNITSIHL AGAGNMVTML IKSELGIEDF GLAEDIKKYP LAKVESLFHI
RHEDGTVEFF EKPLDSSVFA KIVIIKEGML IPVDGQNSLE KIKNVRKTAK ERVFVINCLR
ALKSVSPTGN IRDIEFVVLV GGSSLDFEVP ELVTDALSHY GVVAGRGNIR GCEGPRNAVA
TGLVLAFDRK GVKEND