Gene Haur_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3683 
Symbol 
ID5735562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4633929 
End bp4637033 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content53% 
IMG OID641280835 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001546447 
Protein GI159900200 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00198659 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTTG CCCTGCGTTT CCTCGGGGTT CCTTCAATCC TCTACAATCA GCAAGCACAG 
TCGCTTCCCA GCAAAGCGGT AGCCCTCTTG GGGTATCTTG CAGCAACCAT CCAACCTCAA
CGCCGCGAAC ATCTTCTGGC ATTGCTGTGG GCTGAAAGTA GCGATGAAGC CGCCCGCAAA
AATTTACGCA ATACGCTATG GACAATTCGG CGCAGTTTAG GTGGTGAGAT CATCTTAGCC
GATGAGGATC GCTTGCGTTT GCATCCGGAT TGTGCGGTTG ATCTTTGGCA GTTACGCAGT
TTGGCAGAAG GTTTGGTTTC ACCAACCGTT GAGCGTTTGT TGCAGCTTGC CCAAGGGCCA
TTGCTCGATG GCGTGATGTT ACCCGACGCT CCCGATTTTG ATTTATGGCT GGTGACCGAG
CGCGAACAGG TGCATTTAAC CACGATGCGC TTATTTCAAA CCGCGATCAG CAATTATCAA
AAACAACAGC AATGGGTGCA TGTGTTGACT TTAGCTCGCG CCGCCCTGCG TTTCGATCCG
CTCTCCGAAT CGTTGTATCA GGCGATTATC GAGGCCCATT GGCGGCAAGG TGAACGCGCC
GAGGCCTTGC GCCAATATGA GATGTTGGCC AATACGCTGC AGCGTGAGTT AGGGATCGAT
CCCTTGCCTG AAACCCAAAT GCTGCGCCAG CGCATTTTGC AAACCAACCA ACTGATGAGT
ACAAACAACG CATCAGAACC AGCCAAACCT GTGGTCACAG CGCCGATTGC TGAGCCAAAG
CCCTTGCTCA AGCCAATTAA ACGCGCCGCC CCCCGCGCCG CGCCCTTTGT TGGGCGGCAA
TATGAGCAAT TACTGCTTAA CCAAGCCTTG ATGCGCAGCA ACGAACGGGG CTTGCAAGTG
GTGCTCATTA CTGGCGAAAT TGGGGTTGGC AAATCGCGGC TCTGGCAGGA ATGGGCACAG
CAATTACCTG CTGATAGCAC CATGTTGGCG ATGCACTGCC TTGAATCGAC CCAAAGCTTA
CCGTTTGCCC CGTTGACCGA ACTATTTGGT CAGCGCATCT GCCTATCACG GCTTTTTTCG
GGCGATTCAG CGGTTGATCC GATGTGGTTG GCCGAGGTAG CGCGATTACT GCCGCAATTG
CGTGGGCATA TTCCCAATTT GCCAGAGCCA CCAGTGTTAC CCGCCGATGA AGAGCGACGA
CGTATGTTCG AGGCTTTTGC CCAATGTTTA CGGGCAGTAA TGACCAATCA TCTGGTAATT
GGCATCGACG ATTTACACTG GGCCGACCAA GCCACCGCCG AATGGCTCGA TTACGTCGTT
GATCGCTTGC ACGATTTGCC GATTGTGTTG GTGGTGACCT ATCGTTCGGA AGAGGCCAAC
GCCCGTTTGC AACGCTTGGT GGTTGGTTGG CAACGCGGCA GTTTGGTCAC CCGCATCGAC
GTGCCACGGC TCGATCTCAG CGAAGCACGT GAACTTTTGG GTGCATTGGG TCATCCTAAT
CCTGATGATC AGACCTTGCT CGAACGTAGC GCGGGTAATC CATTATTTTT GCTCGAACTT
TGCCGCACGG CTGATCCGGC TGATGTGCCG CCGATGTTGG TCGATGTGCT CAGCGCTCGT
TTGGCGCGTC TGCCAGAACG AGCAGCCCAA ATTGTCCAAG CGGCAGCGGT GCTTGACCCC
TTGTTTGGCT ATAGTATTCT GCGCGAAACT GGCGGACGTT CCGATGAGGA AACCCTCGAT
GGGCTTGATG GGTTGCTCAG TGCCGCTATC TTAAAAGAGC ACGGCGATAG CTATACCTTT
AGTCATCCCT TGGTAGCAAC CGTTGTCCGT AATAGCCTCA GTCGCGCCCG CCGCAGCTTT
TTGCATCGCC GCGCCGCCCA AGCTCTACAA CACGAATATA GCGATCAACG GGCGATTGCA
GGACGCTTGC TATTGCATTA TCGCGAGGCG GGCGAGGCCA AATTGGCTGC CAGTTATGCC
GACCAAGCTC TAGAGCACGC GCTTTCATTG GCAGCACCGA ATGAGGCTGT GGCTTTTGGG
CAACAGGCGG TTGAGCTTGA TCCAACGCCT GAGCGCTATT GTCGGCTGGG CGATGCCTTG
GAGTGGAATA GCGAAGTGCC TGCGGCGCGT GAGGTTTATC ATACGGCGCT GGCGCAATAT
CAAGCCCAAG CCAATTGGCT TCGAGTTGTG GCAGTGTGCA CCAAGCTTGG CCGCACCTAT
TTGGTCGTCG GCAGGCCTGA GGTGGTGATC GAATGGGCGG AACGGGGCTT GGCAATTTAT
CATAAACACA AGATTGATGA TCAATTGATC GAAGCCGATT TGCTGTTGCT CTTAGCGATT
AGCCAACGTT TGGCGGGCTA CCCATTAAAC GTAGCCTACG AAAATATTCA AGCCGCCTTA
GAGGTTGCGA CCGCCCAGCA AAATCACTCG TTGATTGGGC GTTGTCAATT TGAGTTGGGC
AATATTTTGG CGCAACGCGG CGAAATTAAG GCAGCGGTCG AAACTTTTGC CTTGGCGATT
GCCAGCACCG CCGCCACCAA CGAGCAATAT CAAATTATCT TGGGCTATAA CAATGCTGCC
TATAATGCCA CCTTGATCGG CGATTTGGTC ACGGCGCATA GCCATATTCA AGCAGGCTTG
CACTTGGCCG AACGCTTGGC CTTGCGCGTG CCATTGCAAT ATTTGTATAG CACACGCGGC
GAAATTGCCT TAGCCGAAAA GCATTGGGAC GAGGCCGAAG AATGGTTTGA ACGCGGAATT
AGCGTGGCTC AAGCCAATGG TAATGCGGCT CAAGTTGCCA ATTATCGTGC TAATTTGGGC
TTGGTGGCGC GTGGCCGTGG CGATCTTGAT CAAGCGTTGG TCTTGATGCG CACAGCACTC
AGCGAAGTTG AATTGCTCAC TGCGCCATTT TTACAAACCC AAATCAACAT TTGGCTGGCC
GAAATTTGGC AAGAACGCCA CGATTATCTA GCTGCCAATG CTGCCTTACA ACGGGCACAA
CAGGCCGTCA CACCTGAACA AGGCTTTTTG TATCAACGGG TTCAGCAATT ACAACAACAA
CTTGGCAGCA TGCAACCAGC AATGGTTCAT CAGCGTTCTA TTTAA
 
Protein sequence
MTVALRFLGV PSILYNQQAQ SLPSKAVALL GYLAATIQPQ RREHLLALLW AESSDEAARK 
NLRNTLWTIR RSLGGEIILA DEDRLRLHPD CAVDLWQLRS LAEGLVSPTV ERLLQLAQGP
LLDGVMLPDA PDFDLWLVTE REQVHLTTMR LFQTAISNYQ KQQQWVHVLT LARAALRFDP
LSESLYQAII EAHWRQGERA EALRQYEMLA NTLQRELGID PLPETQMLRQ RILQTNQLMS
TNNASEPAKP VVTAPIAEPK PLLKPIKRAA PRAAPFVGRQ YEQLLLNQAL MRSNERGLQV
VLITGEIGVG KSRLWQEWAQ QLPADSTMLA MHCLESTQSL PFAPLTELFG QRICLSRLFS
GDSAVDPMWL AEVARLLPQL RGHIPNLPEP PVLPADEERR RMFEAFAQCL RAVMTNHLVI
GIDDLHWADQ ATAEWLDYVV DRLHDLPIVL VVTYRSEEAN ARLQRLVVGW QRGSLVTRID
VPRLDLSEAR ELLGALGHPN PDDQTLLERS AGNPLFLLEL CRTADPADVP PMLVDVLSAR
LARLPERAAQ IVQAAAVLDP LFGYSILRET GGRSDEETLD GLDGLLSAAI LKEHGDSYTF
SHPLVATVVR NSLSRARRSF LHRRAAQALQ HEYSDQRAIA GRLLLHYREA GEAKLAASYA
DQALEHALSL AAPNEAVAFG QQAVELDPTP ERYCRLGDAL EWNSEVPAAR EVYHTALAQY
QAQANWLRVV AVCTKLGRTY LVVGRPEVVI EWAERGLAIY HKHKIDDQLI EADLLLLLAI
SQRLAGYPLN VAYENIQAAL EVATAQQNHS LIGRCQFELG NILAQRGEIK AAVETFALAI
ASTAATNEQY QIILGYNNAA YNATLIGDLV TAHSHIQAGL HLAERLALRV PLQYLYSTRG
EIALAEKHWD EAEEWFERGI SVAQANGNAA QVANYRANLG LVARGRGDLD QALVLMRTAL
SEVELLTAPF LQTQINIWLA EIWQERHDYL AANAALQRAQ QAVTPEQGFL YQRVQQLQQQ
LGSMQPAMVH QRSI