Gene Haur_0175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0175 
Symbol 
ID5732084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp204764 
End bp206662 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content51% 
IMG OID641277299 
Producthypothetical protein 
Protein accessionYP_001542955 
Protein GI159896708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000546306 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTC AACGAATGAT TGGGCTGGGT GGCCTGCTTG GGCTAGCACT ATTGATCTTA 
ATCCTGCCAA ACAGCCGTTT TTGGTTTGCG GCGCTGGTTT GTGCGTTGGC TCCAGGCTAC
GTGCTCGAGC GCTGGCTGGA TTTGGATTTA GCGCCATTGG TGCGACCAAG CCTATGGATT
GGGCTAAGCC TGGCAGTTTG GCCGTTGGGC TATTTATGGC TGACCACGCT TGGTTTATCG
CTGAGCACTG GTATGATCAC GTTGATTGCC TTTGGCCTGC TGGCTGGCGT AGGTTGGCGT
TTATGGCGCG AGGGCGAGCG ACCTTGGGCC TTGCCAGCGC CAGTGCCAAT TCTTGGATTG
GCGTTATTAA TTGTGACCTT CGCGATTAGC ACTAGAATTA GCCATATTCG CAACGTAGCT
TTTCCTCCGT GGGTCGATTC GCTGCACCAT GCCACGATTA TGCGAGTGAT TGCTGAAAGC
GGCCAAGTGC CCTACTCGTT GCGCCCCTAC ATGCCAGTTG ATAATTTTGG CTATCACTGG
GGCTTTCACG CCACGGCAGC CACGATCTAC AATCTGAGTG GCATGAGCAT TCCCCAATTT
ATGCTGTGGT ATGGTCAATT CTTGGGTGTG TTGGTGGTGA TTTCGGTTGG CAGCGCGACG
ATTGGCCTAA CCAAAAGCCC GATTGCTGGC CTAGCCGCCG CCACCATGAC GGGCTTTATC
TCGATTATGC CCGCTTATTA CCTGAGTTGG GGCCGTTACA CCTTGCTTTC GGGTTTGGCG
ATGGTTCCAG TGGTGTTGCT GTTGGCGTGG GTTGCGCTTG ATCGACCTGA TCGCAAAGGC
CTTATTTTGC TAACGCTCGT GGTTGGTGGG CTACTGCCAA CTCACTTTGT GGCGGCTGGC
TTTGCGTTGT TATGGTGTGT CGCAGTTTGG TTGGGCCGCG ATGTTTGGAC CGAGCAGCGT
TGGCAAATCT TGGGCAAGCA AGCGGCATCA GTCGGCATGG CGATTTTGTT GATGTCGCCA
TGGCTGGCCC TATTGATTCG TGAAATTCAG CCTGCTGGCA GTGGCACACC CAAGCAATTG
ATTGGCGGCG GCTACAACAC CTATGAAGCT GCCAAAGGCT TGTATTGGAC GTGGAATAAC
CTTTTGCTCT TCTTAGTAGG TTTGTTGGCG GCTTGGATTG GCTTGTTTCA ACACTGGCGT
TTAGTGTTGA TCAGCTTTTT ATGGGCTAGC CTTGTCATGC TGTTTGCTAA TCCAGTGGTA
ATTGGCTTGC CCTACCTCTC GTTTTTCAAC AACAACATTG TGGCCTTAGC AATCTTTTTG
CCGATTAGTT TGTGGTTTGG CTTTGGGGTT GCTTCATTAG ACCAAGGCTT GAGCAAACAT
CTCAAACAGG GAGTAGCCCG AGGTTGGCGG GCGATTCGCA CCGCAATTTT GGCAATAACC
GTGCTGATTT CGGCTACCAA AATGCACAGC GTAATCAACG ATGGTACGAT TATCGCCAAA
GCTGATGATT TAACTGCCCT GAATTGGATT GTGCAGCGCA TTCCCAAAAA TGCGCGGTTT
GCGATTAATA CCGAAGGTTG GTTGTATAAC GTGGCCCGTG GCAGCGATGG TGGCTGGTGG
ATTTTGCCCT ATGCTGGCTT GCAAGTGAGC ACACCGCCAG TTGTCTACAA CCAAGGTACA
GCTGAGTATA TTGCGGCGGT TGAGGCTGAA ACCAGTTGGT TGCGCAATGC CAACGAAAAA
AGTGCTGCTG AATTGGCTCA GTGGATGCGT GAACATAACT ATGATTACGC CTATGCTACC
ACCAATGGCA AAATCTTCAA TCAAGCCAAA TTAGCCAATA CAGCTGAATT TGAGCTGGTC
TATGAAAATG CAAGTGTGGC GATTTATCTG CGGAGATAG
 
Protein sequence
MNRQRMIGLG GLLGLALLIL ILPNSRFWFA ALVCALAPGY VLERWLDLDL APLVRPSLWI 
GLSLAVWPLG YLWLTTLGLS LSTGMITLIA FGLLAGVGWR LWREGERPWA LPAPVPILGL
ALLIVTFAIS TRISHIRNVA FPPWVDSLHH ATIMRVIAES GQVPYSLRPY MPVDNFGYHW
GFHATAATIY NLSGMSIPQF MLWYGQFLGV LVVISVGSAT IGLTKSPIAG LAAATMTGFI
SIMPAYYLSW GRYTLLSGLA MVPVVLLLAW VALDRPDRKG LILLTLVVGG LLPTHFVAAG
FALLWCVAVW LGRDVWTEQR WQILGKQAAS VGMAILLMSP WLALLIREIQ PAGSGTPKQL
IGGGYNTYEA AKGLYWTWNN LLLFLVGLLA AWIGLFQHWR LVLISFLWAS LVMLFANPVV
IGLPYLSFFN NNIVALAIFL PISLWFGFGV ASLDQGLSKH LKQGVARGWR AIRTAILAIT
VLISATKMHS VINDGTIIAK ADDLTALNWI VQRIPKNARF AINTEGWLYN VARGSDGGWW
ILPYAGLQVS TPPVVYNQGT AEYIAAVEAE TSWLRNANEK SAAELAQWMR EHNYDYAYAT
TNGKIFNQAK LANTAEFELV YENASVAIYL RR