Gene Haur_4549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4549 
Symbol 
ID5736394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5821936 
End bp5824524 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content52% 
IMG OID641281711 
ProductCHRD domain-containing protein 
Protein accessionYP_001547308 
Protein GI159901061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAATC GGTTTCGAAC ACCTCGCATG TTCTCGTTCG TAACATTGCT CTCGCTATTT 
AGCCTTGTCG TTGGTCTAAA CTTAGCCCAA GCAGCCCCGC GCACCACTCA AGGCGGTGGT
GGCTTTACCC CAGGCAATTT GGTGGTAGTA CGGGTTGGTA ATGGCAGTGG CACGCTTTCA
AATGCTGCAA CTCCAGTTTT TCTCGATGAG TACACCCCCA ACGGCGATTT TGTGCAAGCT
GTTGCTATGC CAACTGCCTT GAATGGCAAT AATCGCCGCC TTACCATGTC GGGTTCGGCA
ACCTCGGAGG GTGCACTCAG CCTTTCAGGC GATGGTCAGT GGTTATTGCT AGCAGGCTAC
GATGCTGATG TTGGTACGAC GAGTGTTGCA TCAAGCAATA GTGCCACGGT CAATCGGGTG
GTTGCCAAAG TCAGCCTCAG TAGCCTAATT GATACAACCA ACGTGATTAC TGATGCCTAT
AGCGCTAACA ATATTCGTGG CGCAACAGGT AACGGCGACA ACCTTTGGGC AACCGGAACT
GGTAATCCGG GCGGAGTACG TTTTCTGAAT GGGATGGGCA CAACCACAAC CTTGATCACC
CCAACCAACA CCCGCGTTGT ACATACCTAT GGTGGTAATC TCTACTTCTC GTCATCATCA
GCTACATCGC GTGGTATTTT CCAAATTGGC AATGGCCTGC CAACCAGCGC AGGCCAAACG
ATTACGCCAA TCGCTAGCGC AACCTCGCCC TATGGCTTTG TCTTTTTAGA TCGTGAGCCA
AGTGTGGCAG GCGTTGACAC GTTGTATGTG GTTGATGATA CATCAACAAA CTTGCGTAAA
TTCTCCTTCG ATGGCACAAC CTGGACATTG CAAGGTGTTT GGGCTGAGCC AAGCACTGGG
TTTCATATTG CAGCCCAAGA TAATGGAGCC AACGGCGTTG ATTTATACCT GACTCGCGGA
ACCAATAGCC TGAGCAAATT GACCGATACC GCAGCTTATA ACCAACCAAT TAATGCTGCG
GCCCTAAATC CCATCGTTCC GGTTGGGGCC AACACAGCGT TTCGTGGAAT TATCGTTGTG
CCCAGCCCAG TTCAAGCTAC GCCAACTCCC ACTGAAACTG CGACAGTAAC GCCGAGCGAA
ACCCCATCGC CAACCGCCGT GCCAAGCTGT TTGCGCTTTA TGGTTAGTTT GGAAGGCTCG
CAAGAAGTAC CACCAAGTGG CAGCAATGCA ACTGGCGGTG GCACAGTCGA TGTTGATACC
GTCAACAATA TTTTGAGCTA TAACCTCAGC TACCAAGGCT TGAGCGGCAC CGAAACGGCT
GCGCATATTC ACGGTTTTGC TCCACGCGGA GCCAATGCTG GCGTTTTAGT TGGCTTGAGC
ACTGGTTCGC CCAAAGTTGG CATCTTCAAC TACAGCGAAG CCCAAGAAGC CAATATTTTG
GCCGGCCAAG CCTACGTCAA TATTCATACC GATAGTTTCC CTGGTGGCGA AATTCGCGGT
CAAATCGATG GTGCAACGAT TAATTGTCCA CCGCCAACCG CTACGCCAAC CGAAACGACT
ACTCCAACTG CCACTGAAAC CCCATCTGCT ACGCCAATTC CAAGCTGTTT GCGCTTTATG
GTTAGTTTGG AAGGTTCGCA GGAAGTGCCG CCAAGCGGCA GCACTGCAAC TGGCGGCGGC
ACAGTCGATG TTGATACCGT CAACAATATT TTGAGCTATA ACCTCAGCTA CCAGGGCTTG
AGCGGCACCG AAACGGCTGC GCATATTCAC GGTTTTGCCC CACGTGGAGC CAATGCTGGA
GTCTTAATTG GCTTGAGCAC TGGTTCGCCC AAAGTTGGCA CCTTCAACTA CGCTGAGAAT
CAAGAAGCCA ATATTTTGGC CGGCCAAACC TACGTTAATA TTCATACCGA TAGTTTCCCT
GGTGGCGAAA TTCGCGGCCA AATCGATGGT GCGACGATTA ATTGTCCACC GCCAACCGCA
ACCCCGACCG AAACTGTTAC GCCAAGCGCT ACGGCAACTG CGACCGCAAC CGCAACAGGC
ACGGCCACAC CAAGCCTAAC CCCAATCGTT ACACCACCAC CAAGCTGTAT CACCATGACG
GTAAGTGGCA GTGGTTCGCA AGAAGTACCA CCGAATAACA GCAATGGCAT GGTGATGGGC
ATGATTGAAA TTAATACGGT TGCCAACACG ATCAACTACA ATCTCAGCTA CCACGATTTG
AGCAGCGCCG AAACTGCCGC GCATATTCAT GGTTTTGCGC CGCGTGGCTC GAATGCTGGC
GTATTATTCA ACTTGCCGCT TGGTGCTAGC AAAGTTGGCT CGGTCAGCTA CGCCGAAAAT
CAAGAAGCCA ATATTTTGGC CGGCCAGACC TACATTAATA TTCATAGCAG CAATTTCCCG
GGTGGCGAGC TTCGCGCCCA GCTTGATGGT GCAACCGCCT TATGTGCCAC CCCAACACCT
ACGGCGACAG GGACTGCTAC GAACACGCCA ACCAACACAG CGACGGCGAC AGTGACCAAT
ACCCCAACTA ATACCGCCAC TGCTGTGCCG CCAACCTTTA GAATCTACCT GCCATTGACC
ATGAAATAA
 
Protein sequence
MLNRFRTPRM FSFVTLLSLF SLVVGLNLAQ AAPRTTQGGG GFTPGNLVVV RVGNGSGTLS 
NAATPVFLDE YTPNGDFVQA VAMPTALNGN NRRLTMSGSA TSEGALSLSG DGQWLLLAGY
DADVGTTSVA SSNSATVNRV VAKVSLSSLI DTTNVITDAY SANNIRGATG NGDNLWATGT
GNPGGVRFLN GMGTTTTLIT PTNTRVVHTY GGNLYFSSSS ATSRGIFQIG NGLPTSAGQT
ITPIASATSP YGFVFLDREP SVAGVDTLYV VDDTSTNLRK FSFDGTTWTL QGVWAEPSTG
FHIAAQDNGA NGVDLYLTRG TNSLSKLTDT AAYNQPINAA ALNPIVPVGA NTAFRGIIVV
PSPVQATPTP TETATVTPSE TPSPTAVPSC LRFMVSLEGS QEVPPSGSNA TGGGTVDVDT
VNNILSYNLS YQGLSGTETA AHIHGFAPRG ANAGVLVGLS TGSPKVGIFN YSEAQEANIL
AGQAYVNIHT DSFPGGEIRG QIDGATINCP PPTATPTETT TPTATETPSA TPIPSCLRFM
VSLEGSQEVP PSGSTATGGG TVDVDTVNNI LSYNLSYQGL SGTETAAHIH GFAPRGANAG
VLIGLSTGSP KVGTFNYAEN QEANILAGQT YVNIHTDSFP GGEIRGQIDG ATINCPPPTA
TPTETVTPSA TATATATATG TATPSLTPIV TPPPSCITMT VSGSGSQEVP PNNSNGMVMG
MIEINTVANT INYNLSYHDL SSAETAAHIH GFAPRGSNAG VLFNLPLGAS KVGSVSYAEN
QEANILAGQT YINIHSSNFP GGELRAQLDG ATALCATPTP TATGTATNTP TNTATATVTN
TPTNTATAVP PTFRIYLPLT MK