Gene Haur_5030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5030 
Symbol 
ID5736989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp40715 
End bp43885 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content58% 
IMG OID641282197 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001547788 
Protein GI159901542 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.311949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCGAC AATTACCCAT ACCCTATTTG CCGCCGGACA CTTCTGCCGC CGCCGACTTT 
CGGGTGTACG TGCTTGGCAC TCCCGTCTTG CTTTGGGCCA ATACTCCCTT CTCGATTGCT
CGGCGCCAAG CCCGCGCGCT GCTGTATCGT TTGGCCTCAG ATCTGAACCC AGTGGCACGC
ACCGAGCTAG TTTACCTTTT CTGGCCCGAC ATGCCCGATC TGACCGCACG CCGCCATCTG
ACCCATCTGC TGACCCGTCT TCGCCAGGAC ATCCCCGATC CTCGCATACT CATCGCCACA
CCTGATCATG TTACCCTTGA TCCTGAACGC GTCTGGATTG ATAGCTCCGT TTTCATGCAT
GCGGTTGCCA CCACTGATCC TGAGCACCGA CTGGAAGCCC TTGATCATGC AGTAACCCTG
GTACGCGGCC CCTTTCTCCA TGGGGTGGCC CTCGCCGATG CCCCGGAATT TGAGCTGTGG
CTGGCCCAGG AACGCAGCAA CTGGGAGAAC CGCACGTTGG CTGTGCTTGA CACCCTACTT
GAGCAGGCCA CTGTGGTACG CAACTATCCG CTGGCTATCC GTACTGCCCA ACAATCCCTA
GCCTTGAACC CGTTGGCCGA AGATGTCCAT CGACACCTGA TCGGCCTATA TGCAACCATC
GGTGATCGCG GTGCTGCCGT GCGGCACTTT GAGCACTGCC GCACCTTACT TGAGCAGGAG
CTTGGTGTCA CGCCTCTACC CGAAACGTTG GCCATCTACG AGCAGGTTCG TGCGGGCCAG
AGTCCCTTTC CCTCTGAAAC GGCGGGTCAA CGAGGCCTGA TCCTAGCGGG AAACGCGGTG
ACTCACCCAG TCGATTTCCC TGCTCGTCCC GAATCAGTGA TCAACCCCGA TTCGGCGGAC
GAGGCCCCCG TCTGGGCACC TGATGTCACT GAAAAGGGCC AGATGTTGGT TACTAAACCA
TTGTATGGCC GCGATGCCGA GGTAGCGATC ATAACAACCA TGCTCGCGAG TGCGATGCCT
CGCCTGATCA CACTGAACGG TCCCGGCGGC AGTGGCAAAA CCCACCTCGC ACAGCAAATC
GCCGCCAGCG TCGATTTCCC GGATGGGGTG GTCTGGGTCG CGCTTGGATC TCTGCGCGTA
CCGGGATTAC TTCGCGATGC CATCGCCTAC GCCTGCGGGG TTCGCACCAC TGGCTGGAGC
GCAGCCAGTG CGGCTGTGGC AGATCAGCTA CATACAGCGC TCCAGCCCAA ACACATGCTT
TTGGTGCTCG ACAACGCTGA ACATCTCCTA GACGGGACAG GCGTGATCGC GGAGTTGTTG
GTGGCAGCAC CAAACCTACG GGTTCTGGTG ACCAGCAGGG TCGCATTAAA TTTGCCAGGG
GAGCAACTAG TTCCTGTGCC GCCACTGCCC GTCCCATCCC TAGCCACATT GCCACCGATC
GAGCAACTGG CGCTCCAGCC AGCCGTAGCC CTGCTGATTC ATCGCGTACG CGAACGCCAG
CCTTGGTTCA TGTTAAGTGA GGAAAATGCA GCAGATATCG CCGCCATCTG CGTGCGCTTG
GATGGACTGC CGCTGGCCCT TGAACTGGCC TCCACCCGGC TTATCACCCT TACACCAGGT
GCCGTGCTTG CACGACTGAA CCATCGCCTG ACGTTGCTCA CGCGCGGGCC ACACATTCTT
CCAGAGCGCC AGCAAACCTT ACGGGCCACG ATTGACTGGA GCCATCGATT GCTTGATCTG
TCGGCGCAGG CGATGTTTGC CAACCTAGCA GTCTTCGCCG GTGGGTGGTC ATTGGCTGCT
GCCGCAGCCG TGATGCAGCA CCAGATGCCA GCCGCTGCGC CGGTGCCTGA CGAGATCGCC
GTGTTAGATC TGATGCATGG ATTGCTTGAG CACAACATGA TTTTTTCAAT TCCGGGGGAG
GAGCCGCGTT TTGACATGTT GGATACCCTG CGCGAGTACG CCCAGGAGCA ACTGAAAGCA
CGCGGTGGTG CTGGCACGGC GGACGAGGCG CATGCCACGT TTTATCGTGA GCTCGCGCTT
CGTGCCACAC TCCACATCCA GGGCGAACAG AGTACGGCCT GGCTGACTAT GCTTGCACTG
GATCATGATA ATTTGCGCGT GGCACTGGCA TGGTTTCTCG GGCAACCCGA TGGCGGTGCC
GGCGCTATCG ATATCACTAA CATATTGCAA GTGCTTTGGC GCTGGCGCGG TGAGTACCAT
GAAGCCCGCC ATTGGATACC ACAGGTGCTT GATCACAGTC GGGGATTGGC ATCGGAGAAA
CATATCCATC TGCTCATGAT TGCTGGGAGT GCTGCTGCGT TTTATGGTGA TCGGGATACG
GCGCTGGACT GGTTCACTGA GGGTCTTGAT CTGTGCCGCG ACGTTGCAGC TCCGCTCATT
GAGTCAGATT TGCTGATCAA CATGGGGCGA ATCTATTGCT ATCGTGGAGA TTTTGTGCAC
GGCTGTGCAC TCTCTGAAGC AGCCCTCGTC CTCAGTCGTA AGACAGAAGA CCCACTGCAT
CTCGTCAGGA CGATTCGCAA CCTTGCAGCC GCGCTCTGCG GAGGATTGGT CAATATTGAA
CGTGGCATGG CCCTGTTTGA AGAAGGATTA ATCATTGCAC GTGCAACAGT CGATAGCGGT
GCCTACAGTC AGGTTACCCT AGCGATCTTC CTCATTGATT TTGGATCTTA TCTGGCGTTG
ACAAACCAGC ACACACGCGC GGCGATGCTC CTCGCCGAAG CCTTGGCGTT GGCCGAGCAG
AACGATCATG CTATCGCCAG AGTCAACGCC CTGGGAAGTT TGGGGTTTCT TGCGTTGCTG
CAGAGTGACC GCGCAACAGC GTGCCGCTAC TTTCTGGACA GCCAGCAGCT CATCAATGGA
ATCTCGGCAC CCATGAATAC AGTCATGAAT ATTGAAGGAC TCGCCGAAGT GGCTGCGGAA
AAACATCCGC GCCTGGCTAT TCAGCTGCTG GCCGCAACCA GCGCGGCACG TACGGCGCTG
GATATGCAGA TCGAACCCAT CGAACAGCGG CATCGTGCCC AAGTCCTCGC TGACTTGCGA
CACGAGCTTG GCGACGAAGA GTTCTCGATA GCATGGTATC AGGGTCAGGA CTGGACTGTA
GAGCAGGCCC TAGCAGCAGC ACGCACTGAT ACTGGAGAAA AACAAGGGTA A
 
Protein sequence
MLRQLPIPYL PPDTSAAADF RVYVLGTPVL LWANTPFSIA RRQARALLYR LASDLNPVAR 
TELVYLFWPD MPDLTARRHL THLLTRLRQD IPDPRILIAT PDHVTLDPER VWIDSSVFMH
AVATTDPEHR LEALDHAVTL VRGPFLHGVA LADAPEFELW LAQERSNWEN RTLAVLDTLL
EQATVVRNYP LAIRTAQQSL ALNPLAEDVH RHLIGLYATI GDRGAAVRHF EHCRTLLEQE
LGVTPLPETL AIYEQVRAGQ SPFPSETAGQ RGLILAGNAV THPVDFPARP ESVINPDSAD
EAPVWAPDVT EKGQMLVTKP LYGRDAEVAI ITTMLASAMP RLITLNGPGG SGKTHLAQQI
AASVDFPDGV VWVALGSLRV PGLLRDAIAY ACGVRTTGWS AASAAVADQL HTALQPKHML
LVLDNAEHLL DGTGVIAELL VAAPNLRVLV TSRVALNLPG EQLVPVPPLP VPSLATLPPI
EQLALQPAVA LLIHRVRERQ PWFMLSEENA ADIAAICVRL DGLPLALELA STRLITLTPG
AVLARLNHRL TLLTRGPHIL PERQQTLRAT IDWSHRLLDL SAQAMFANLA VFAGGWSLAA
AAAVMQHQMP AAAPVPDEIA VLDLMHGLLE HNMIFSIPGE EPRFDMLDTL REYAQEQLKA
RGGAGTADEA HATFYRELAL RATLHIQGEQ STAWLTMLAL DHDNLRVALA WFLGQPDGGA
GAIDITNILQ VLWRWRGEYH EARHWIPQVL DHSRGLASEK HIHLLMIAGS AAAFYGDRDT
ALDWFTEGLD LCRDVAAPLI ESDLLINMGR IYCYRGDFVH GCALSEAALV LSRKTEDPLH
LVRTIRNLAA ALCGGLVNIE RGMALFEEGL IIARATVDSG AYSQVTLAIF LIDFGSYLAL
TNQHTRAAML LAEALALAEQ NDHAIARVNA LGSLGFLALL QSDRATACRY FLDSQQLING
ISAPMNTVMN IEGLAEVAAE KHPRLAIQLL AATSAARTAL DMQIEPIEQR HRAQVLADLR
HELGDEEFSI AWYQGQDWTV EQALAAARTD TGEKQG