Gene Haur_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3841 
Symbol 
ID5735706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4821508 
End bp4824930 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content51% 
IMG OID641280994 
Producthypothetical protein 
Protein accessionYP_001546605 
Protein GI159900358 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0865863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGGA TGCTACAGTT TTTGGCAATC GTGATGCTGG TGATCGGTGG CACTTGGCCG 
CAATTTCAGG CCAACGCAGC TCAATCAGAG CCAAATTTGC AGGCTCCCAC GGCACTCAAT
GCGGTGGCCG ACGACCAATT TTGGAATAAC GATAATTTGG TGGCCGATGC CAATAATAAG
ATTAATGCGA TTACAACATT GAATAACGAT CTGTTTGTTG GTGGGTTGTT TAGCCGAGTC
GGCGGCATCG ATGCTGCACG AGTAGCCCGT TGGAATGGCG AACGTTGGTT TGCGCTAGGC
AGTGGTATCA GCGGCGCAAA TGCCAAAATC GAGGCGATCG ATGCCTCAAC CAGCGGCAAT
GTCTACGCTG TGGGCGAGTT CGCTAGTGCT GGCGGGGTCG CCGCTGATAG TATCGCCCGC
TGGAATATTG CCACCCAACA ATGGTCGGCG CTAGCAACCA ATGTTAATGG CTCGGTCGAT
GCTGTGGTGG TTGTGCCCGA TGCGAATGGC GATATTGTGT ATGTTGGCGG CAGCTTTACG
AGCATCGATG GGGTCAGCGC CAATCGAATT GCCCGCTGGA CGAATGGCAC ATGGAGCGCC
TTGGCTGATG GCCTAGCAGG CACAAATGCC CAAGTTTTTG CTGTAGCCTT CAATCCCAGC
AACACGGCCC AAATTTTGGC AGGCGGGGTA TTTACCAGCG CCAATGGTTT GGTTGCCAAC
AATATCGCCT TATGGGATGG CAATGCTTGG CAAAGCCTTG GTACTGGCAC GAACAATGGG
GTAAATGGTC GAGTGCGCTT TGTCGATTTT CGCAGCAGCA ATTTTGTGGT GGTTGGTGGT
AGTTTCACCA GTGCAGGCGT TGTCTTGCCA GTGGGCGGCT CGGCAGCTTG GCGCGGCGGC
AATACGTGGG AAGCCTTCAC TGGCCGCGGC TTTAATCCAA TTGCGTCAAA TTCTGAAATT
CGCGCAATTG CCGAATTACC TGGGCGAATC TTTGTTGGGG GCAATTATTT TGGTCTGGTG
AACTCCAGCG GTCAGGTAAC CACGGGGAAT TTAATGGGGC TATGGAACGG CACAACCTGG
CAGTTTATAC CAAATTCACC CTTCAAAATT AGCCAAATCG AACGGATTGG CACAACTGAA
CAATTTTTCG TTACTGGTGA AAGTACCAAT ATACTTAATC CGAGTGGTTT TGTTAGTTTG
TTTATTCCCA CGCCAACTGG CAGCAATCAG CCGCATCAAT TCTTGCCCTT GGGTGGCACA
ATCAGCGGGC CGGATAATGC AGTATTTGCG GTGCATGCCA GCGCTTCAGG CGTGTTTGTC
GGTGGCGAAT TCAACCGCGC CAGCGATCAG GCGATCAACA ATATTGCGCT GTTCAACCCA
ACGACCCGCA CATGGTCGCC ATTATTTGGC AAAACTGAAA ATGGCACCAA TGATACCGTG
CGGGCAATTG CGCCATTTGG CAGTGATCAA TTGATCGTTG CTGGCGATTT TAGCGAGGCT
GGCGGAATCA ACACCGCAGG CGTGGCCAAG TGGGATATTG GTGATCAGCA ATGGTTTGAG
ATTTCCAATA GTATCAATGG TTCGGTGCGG GCAATTGCCA TCAACGGTAG TGAAATTGTG
ATTGGCGGCG ATTTTACCCA AATCGATGGT ACGACTGTCA ATCATATTGC CCGCAGTACC
AACGGCGGAA ATTCATGGTC GGCGCTTGGC TCAGGCATTG GCGGGCCAGT TCATGCCTTG
GTGTTCCGCA GCGGAACGTT ATTTGCAGGC GGTGATTTTA GCACTGCTAG CGGCGTGAGC
GCCAATAATA TTGCCCGTTG GAATGGTACA GCCTGGCAAG CATTGCGCGG TGGCACTGAC
GATATTGTAT TTTCGTTAGC GGCATTTGGC ACCCAGATTG CGGTTGGTGG CTCGTTCAAT
TTAGCTGATG GCGTGGCTGG AACCAGCGCA ATCGCCCTTG TTCACCCGAC AACTGGGGTT
TGGTCGCCGT TGGCTCAAGG CTTTGGCATT GGCAACACTG TTCAAACGCT GGCAGTGCGT
GGCACTGATT TGTATGCAGC TGGTACGTTT TTCTACTCGC AGCCTAACCC CACTCGCCCA
ACCAATATCG CCCGTTGGGA TGGTACTGCT TGGCAAGCAT TGGGTAGCGG AATTAGTGGC
GGCACTGGCA ATGCCGATTC GGTCAAAGGA TTCGCAATGA GCGTGCGCGG CGATGATCTA
TTTGTTGGTG GAATTTTTGA TGCTGCTGGC AGCAAATCAT CGAAGCGCTT TGCCCAATGG
ACTCAGCCCG AAGTTGATTT GGCGGTCAGT CTGCGCGATT CGGCTGATCC GGTACAGGTC
AACACCCCAT TCCAATACAA TGTGAGCTTA ATCAACCAAG GCATCATTAC CGCAACCAGC
GTGATTTACG AGCAAACATT TGATAGCTCG GTTAGTTTTG GCAATATTAG TGCTAGCCAA
GGCACATGTA GTTTCAGCAA TGCCACGACC TTGCGTTGTA CGCTTGGCAC GCTCACGCCC
AATGCCCGCG CTACTGTGGT AGTCAATGCT ACGCCAACTC AAGTTAGAAC GATCACAAGC
CGTGGCACGC TTAGTTCGCC GGCTGATGAG CCTATTACAG CCAACAATCA ACAACAAATT
AGCACCCAAA TTATTGCTCC AGGCAACCCA GTGCCAACTA TCAGCAGCAT TACGCCCAAT
AGTTTTGTGC AACAAGCAAT TGGCTTGCCA GTGCAAATCA GCGTCAGCGG CACGGGTTTT
GTTGCTAGCT CCAAGGTGTT TGTTGATGGA ATTGAACGCC CAACTACCTT CTTAAATAAT
GTTCAAATTA ATTTCTCAAT GCCAGTCAAT ACAGCACTGG GTAATCATAG CGTGATTGTG
CGCAATCCCA CACCTGGCGG CGGCGATTCG AATAGCGTTA ATTTGGCAGT TTTGCGCAAT
AACGTCGGAT TTAGCAGCAT CACTCCCGAT ACAGGCGGTG TAGATGTAGG TTTGCAAACC
ACCTTCACGA TTAGCTGGAC TCACACCACC GATCCATGGC GGATTATCGA TAGCCTTGAT
TTACGCTTGG TCAATGCCGA TGGAATTGGG TTGTGGGCAC GCTTTACTGA GGGCGTTTCG
GGCACATTTA GTTTGCTCAA CAGCGAGGGT GATATCATCG GCACAGCGAT TGCCGAAACT
CCAAACATCC TCAGCAGCGA CACTGCCGAG CTTGATGTTG AAGCCAGCAG TTTTGCCGGC
AGCGGCCCAA CTGGCTTTAG CATGAATGTA ACCTTCAACG TGACCTTCAA CGAGGCGGCG
CGTGGTCGCT ATAATATCGA ACTCTACGCC AGCGATGATC ATGGCGAGTT GCAAGGGCCA
GATGTGCTGG GCACCTTCGA TGTTGGCATT CACCAAATCT TCCTGCCAAT GACAATTAAA
TAA
 
Protein sequence
MRRMLQFLAI VMLVIGGTWP QFQANAAQSE PNLQAPTALN AVADDQFWNN DNLVADANNK 
INAITTLNND LFVGGLFSRV GGIDAARVAR WNGERWFALG SGISGANAKI EAIDASTSGN
VYAVGEFASA GGVAADSIAR WNIATQQWSA LATNVNGSVD AVVVVPDANG DIVYVGGSFT
SIDGVSANRI ARWTNGTWSA LADGLAGTNA QVFAVAFNPS NTAQILAGGV FTSANGLVAN
NIALWDGNAW QSLGTGTNNG VNGRVRFVDF RSSNFVVVGG SFTSAGVVLP VGGSAAWRGG
NTWEAFTGRG FNPIASNSEI RAIAELPGRI FVGGNYFGLV NSSGQVTTGN LMGLWNGTTW
QFIPNSPFKI SQIERIGTTE QFFVTGESTN ILNPSGFVSL FIPTPTGSNQ PHQFLPLGGT
ISGPDNAVFA VHASASGVFV GGEFNRASDQ AINNIALFNP TTRTWSPLFG KTENGTNDTV
RAIAPFGSDQ LIVAGDFSEA GGINTAGVAK WDIGDQQWFE ISNSINGSVR AIAINGSEIV
IGGDFTQIDG TTVNHIARST NGGNSWSALG SGIGGPVHAL VFRSGTLFAG GDFSTASGVS
ANNIARWNGT AWQALRGGTD DIVFSLAAFG TQIAVGGSFN LADGVAGTSA IALVHPTTGV
WSPLAQGFGI GNTVQTLAVR GTDLYAAGTF FYSQPNPTRP TNIARWDGTA WQALGSGISG
GTGNADSVKG FAMSVRGDDL FVGGIFDAAG SKSSKRFAQW TQPEVDLAVS LRDSADPVQV
NTPFQYNVSL INQGIITATS VIYEQTFDSS VSFGNISASQ GTCSFSNATT LRCTLGTLTP
NARATVVVNA TPTQVRTITS RGTLSSPADE PITANNQQQI STQIIAPGNP VPTISSITPN
SFVQQAIGLP VQISVSGTGF VASSKVFVDG IERPTTFLNN VQINFSMPVN TALGNHSVIV
RNPTPGGGDS NSVNLAVLRN NVGFSSITPD TGGVDVGLQT TFTISWTHTT DPWRIIDSLD
LRLVNADGIG LWARFTEGVS GTFSLLNSEG DIIGTAIAET PNILSSDTAE LDVEASSFAG
SGPTGFSMNV TFNVTFNEAA RGRYNIELYA SDDHGELQGP DVLGTFDVGI HQIFLPMTIK