Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3007 |
Symbol | |
ID | 5734894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3797709 |
End bp | 3799439 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280151 |
Product | hypothetical protein |
Protein accession | YP_001545773 |
Protein GI | 159899526 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGGA TTGTGATCCT GCTGGGCTTG ATTTTGAGCC TGATTCTCAT CCCCTCGAAT TCGCCCTCAA CCAGTGCCGC CGAGCCTTCG CGTCCGGTTT TTGCCTACTA CTATGGCTGG TGGGGCAGCC AAACTTGGTA TCTCGACAAG ATTCGCGACC GCCCGCTGGA ACTCTACGAG AGCGACCGCG ATAGCACCAT GCTCAACCAT ATTCGCCAAG CCAAAAATGC TGGCATCGAT GGTTTTATCT GTACATGGCG CTATACCTGT GCCCGCTTGT TGCAACTGGC TGAGCAAGAA GGCAATTTCA ACGTGGTTTT CAGCGTTGAT CCGGTAGCTG ATGGCACGCT CAACTCAACT CAGGCAATTG TCGATAATAT GCGCGAAATG GCTGGCCTCG CCAGCAGCAA TGCCTATTGG CGCTGGGATG GCAAGCCAGT TTTCGTGTTT TGGAATGATA CAATTTTGCC TGGTGGCCGT GGTTCGCTCA GCGATTGGAC CGATCTGCGC AGTCGGGTTG ATCCCAATCG CAATCAATTT TGGCTCGGGG GCGGGGTAAA CTTCAGTCTG CTCGATGTGT TCGATGCGAT TCACTTTTTC GATATTACCT GGGAACGCAA GCAAGGCGAT GCGATGATTT CGTATAGCCG CAATTTGCGC GAATATAACA GCAGCCGCAA TAGTAACAAA CCATTTGTCG CGACTGTTAT GCCTGGTTAT GATGATTTGC TCTATCGCAA CGGCCACTTC CGCGACCGCG AAAACGGCAA CTACTATCGC GCTGGCTGGG ATACGGCGAT CAATTATCAG CCCAAGGCGA TCATTTTAAC GAGTTGGAAT GAATGGTATG AAGGCAGCCA GCTTGAGCCA AGCCAAGCAT ATGGCAACCT CTACCTCGAT ATTTCACGCG AAAAAATTAG TGCCTTCAAA AATAATGTGC CGCCAATTCC CGATGGCTTT GCCGATACCA ACTTTGAAAA AACCTGGCAA CGCACTGATA AGCCCGTCCA AGATGGCCGC ACCAGCCGCT CATGGGTTTG GGGGCCAGCA ATCGCCAAAG GCCAGTACGA GCCATATGGT GGCAGCACGC GGCTCGTCCA ATATTTCGAT AAATCACGTA TGGAAGTCAA TAATCCCAAT GGCGACCGCA ACCAGCCATG GTTCGTTACC AACGGATTAT TGGTGGTCGA GATGATTCGC GGCCAAGTCC AAATTGGCGA TAGCTCGTTT GAGCAACGTA TCCCTGCTGA TGAAGCTTTG GGCGGCGATC CACGGGCAAT TAACGATGTT GCGCCTGGCT ATAGCTCACT GCGCAATGTA TTGGCCAGCC AGCCCGATAA AACTGGCCAA ACTATCAGCA CCCAACTCAA GCGCGATGCG ACGACCAGTG GTATTACTGC GCCAAGTGTT GTCACCAACG CCCATTATGT CAGCCAAACC CAGCATAATA TTCCCGATGT CTTCTGGAAT TTTATGAACC AAACGGGCAC GGTTTATGAA AATGGCCGCT TTGTCAACGA TAAACAAGTT GTTGATTGGG TGTTTGCTTT CGGCTACCCA CTAAGCGATG CCTACTGGAT GCGGGCCAAA CTTGGTGGCA CAGAAACATG GATGTTAGTC CAAGCCTTTG AACGGCGGAT ATTGACCTAT ACCCCAACCA ATGCACCTGC CTTTCAGGTT GAGATGGGCA ATGTAGGCCA ACATTATCGC CGCTGGCGCT ACGGCAACTA G
|
Protein sequence | MRRIVILLGL ILSLILIPSN SPSTSAAEPS RPVFAYYYGW WGSQTWYLDK IRDRPLELYE SDRDSTMLNH IRQAKNAGID GFICTWRYTC ARLLQLAEQE GNFNVVFSVD PVADGTLNST QAIVDNMREM AGLASSNAYW RWDGKPVFVF WNDTILPGGR GSLSDWTDLR SRVDPNRNQF WLGGGVNFSL LDVFDAIHFF DITWERKQGD AMISYSRNLR EYNSSRNSNK PFVATVMPGY DDLLYRNGHF RDRENGNYYR AGWDTAINYQ PKAIILTSWN EWYEGSQLEP SQAYGNLYLD ISREKISAFK NNVPPIPDGF ADTNFEKTWQ RTDKPVQDGR TSRSWVWGPA IAKGQYEPYG GSTRLVQYFD KSRMEVNNPN GDRNQPWFVT NGLLVVEMIR GQVQIGDSSF EQRIPADEAL GGDPRAINDV APGYSSLRNV LASQPDKTGQ TISTQLKRDA TTSGITAPSV VTNAHYVSQT QHNIPDVFWN FMNQTGTVYE NGRFVNDKQV VDWVFAFGYP LSDAYWMRAK LGGTETWMLV QAFERRILTY TPTNAPAFQV EMGNVGQHYR RWRYGN
|
| |