Gene Haur_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3029 
Symbol 
ID5734886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3824907 
End bp3826862 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content52% 
IMG OID641280173 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001545795 
Protein GI159899548 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0253083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAAA ATCGCTTCTT ACGCAACAGT TTTGTGTATT TGATCATCGT AGTGGCGGCG 
TTGGCGCTCT TCTATCAATA CATCAATGGT AGTCCTGGCA ATACCAGCGA AAAGAGCTAC
AGTGAAATTC TCGACTTAGC TCTCCAAGGT AAAATTGCCG AGATCGTCCA AACCGAAGGT
CAAGTCGAGT TTCGGGCGAC CACCAACGAC GCGCCACCCA CCACCTATAT TTCTCGCAAG
AACAGCACGA CCGAAAGTAT TGAAGAAATT CTTTCGCAAA AAGCCCAAGA TGCTAGCAAA
TTAGCCGAGC TTAAAGATCC TGAGAATAAG ACGGCAATTG CAGCACCGTT GGAAAACGCT
GCCAAAGTTA AGTTCAACCC TAAATCGGCT CCAGCTTGGG GCGGTATTTT AGGCGCGGCC
TTGACCTTCT TGCTGCCCAC GCTTTTGCTG ATTGGCTTTT TCGTCTTTTT CATGCGCCAA
GCCCAAGGCT CCAACAACCA AGCGATGTCG TTTGGCAAGA GCAAAGCTCG CATGTTCACT
GGCGACAAAC CATCGGTAAC CTTCGCTGAT GTTGCTGGCC AAGAAGAAGC CAAACAAGAC
TTAACCGAAG TTGTCGAGTT TCTTAAATTT CCTGAGAAAT TTGCCCAACT TGGTGCACGG
ATTCCACGTG GGGTTTTGAT GGTTGGCCCT CCAGGTACTG GTAAAACTTT GCTCAGCCGT
GCGGTTGCTG GCGAAGCAGG CGTGCCATTC TTCTCAATCA GCGGTTCGGA ATTCGTGGAA
ATGTTCGTCG GGGTTGGCGC AAGCCGCGTG CGCGATTTGT TTGAACAAGC CAAACGCAAC
GCTCCTTGTA TCGTCTTTAT CGACGAAGTT GATGCCGTTG GTCGCCAACG CGGTGCTGGC
CTTGGTGGCT CGCACGACGA ACGCGAACAA ACGCTTAACC AAATCTTGGT TGAAATGGAC
GGCTTCGATA GCAATACCAA TGTGATTGTG ATCGCCGCCA CCAATCGCCC TGACGTGCTC
GACCCAGCCT TGGTTCGCCC AGGCCGTTTC GACCGCCAAG TTGTGCTTGA TGCTCCTGAT
ATGCGCGGAC GGGTCGAAGT GCTCAAGGTC CATACCAAGG GCAAACCACT CTCCGAAGAT
GTTAATTTGG AAGCAATTGC CAAATTAACG CCTGGCTCAT CGGGTGCAGA CCTCGCTAAC
ATCGTCAACG AAGCTGCGAT CTTGGCTGCA CGGCGCTCGA AAAAACGCAT CGCCATGCAA
GAAATGCAAG ATGCCACCGA ACGAATTATG CTTGGTGGCC CAGAACGCCG CTCACGCGTG
ATGACTCCTA AGCAAAAAGA GTTGACTGCC TTCCACGAAG CTGGTCACGC GATTGTTGCT
AAAGCGATGC CTGGCGCAAA CCCTGTGCAT AAAGTGACGA TTATTCCTCG TGGGATGGCT
GGTGGCTACA CCTTGATGAT CCCCGATGAA GATCAAAGCT ATATGAGCGT TTCGCAGTTT
GAAGCCCAAA TTGCTGTAGC CCTTGGTGGT CGCGCCGCCG AAGAATTGGT GTTGAGCGAC
TTTACCACTG GCGCTTCGGG CGATATTCAA CAAGTGACGC GCATGGCACG AGCGATGGTT
ACCCGCTATG GGATGAGTTC AGAGCTGGGG CCAATTGCCT TTGGCGAAAA AGAAGAGTTG
ATCTTCTTGG GCCGCGAAAT CAGCGAACAA CGCAACTATT CGGAAGAAAC CTCACGCAAG
ATCGATTCGG AAGTGCGGCG TTTGGTCAGC GAAGGCCACG AACGCGCCCG CGCAATCTTG
GAACGCAACC GCGAGGTGAT GAACCGCATG GCCGAAGCCT TGATCGAGCA TGAAAACCTC
GATGGTGAGC CATTGCGCCA ATTGCTCGAC GAAGTGATTA AGTATAACTC CAACAATGGG
GTTTATAATG ACGCTTTGCC CAAGCAACGC ATCTAA
 
Protein sequence
MGENRFLRNS FVYLIIVVAA LALFYQYING SPGNTSEKSY SEILDLALQG KIAEIVQTEG 
QVEFRATTND APPTTYISRK NSTTESIEEI LSQKAQDASK LAELKDPENK TAIAAPLENA
AKVKFNPKSA PAWGGILGAA LTFLLPTLLL IGFFVFFMRQ AQGSNNQAMS FGKSKARMFT
GDKPSVTFAD VAGQEEAKQD LTEVVEFLKF PEKFAQLGAR IPRGVLMVGP PGTGKTLLSR
AVAGEAGVPF FSISGSEFVE MFVGVGASRV RDLFEQAKRN APCIVFIDEV DAVGRQRGAG
LGGSHDEREQ TLNQILVEMD GFDSNTNVIV IAATNRPDVL DPALVRPGRF DRQVVLDAPD
MRGRVEVLKV HTKGKPLSED VNLEAIAKLT PGSSGADLAN IVNEAAILAA RRSKKRIAMQ
EMQDATERIM LGGPERRSRV MTPKQKELTA FHEAGHAIVA KAMPGANPVH KVTIIPRGMA
GGYTLMIPDE DQSYMSVSQF EAQIAVALGG RAAEELVLSD FTTGASGDIQ QVTRMARAMV
TRYGMSSELG PIAFGEKEEL IFLGREISEQ RNYSEETSRK IDSEVRRLVS EGHERARAIL
ERNREVMNRM AEALIEHENL DGEPLRQLLD EVIKYNSNNG VYNDALPKQR I