Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2568 |
Symbol | |
ID | 5734446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3292299 |
End bp | 3293564 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279708 |
Product | hypothetical protein |
Protein accession | YP_001545334 |
Protein GI | 159899087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.605817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACCA CAGTTATAGG TGTATTCAAT TCAGACGCAC AAGCTCAAGC CGCTGTTCGC GATTTACAAG CACTTGGTAT CACCAATGAT GCTATTTCAG TTGTTGCTCG CGATACTAGC CGCGCCGTTG ATGCCGATGG TAACTTGGTC ACCGTCAGTG ACGATCATAT GACTGCTGGC GAAGGCTTAA CTGTTGGAGC AGTTTGGGGC GGGTTGGTTG GCTTGGCCGC TTTGGCAATT CCTGGCATCG GGCCATTGAT TGGTGCTGGC GCATTGGTAT CGGCCTTGAC CGGCGCAGTT GCAGGCGCAG CCACTGGCGG GATTGCCGGC GCATTGATCA ACGCCGCTAG CGTTCCCGAA GATCAAGCGA ATGTCTACGA AGATCGGGTC AAAGCTGGGA GCACCTTGGT CACGGTGCAC GCCAACGATC ACATGGCGGC CCAAGTAAGA ACTACCTTGC GTCAAGCAGG AGCCGAACGC TTCCAGTGGG ATAGCGATAC CGATTACGAC CAAAGCAATG AGCAAGCCTA TGCCGATAGC AGCAAAGTTG GCACAGTTGG CGGTGGTGCT GCTGGCGCGG TAACTGGTGC AAGCATCGGT GCTGCTGGTG GCCCAGTTGG CGCGGTGATC GGCGGGGTAA CTGGCGCTGT GGTTGGTGGA GCAATTGGCG CTGCTGGCGA TACCACAGGC GAAAAAATGA ATGATCGCAC GCATGATAGC GATTATCCGC ATAGCACCGC CTATGATACG ACCAATGCCT ACTCTAGCAA TGCTACCTAT GATCAAATGA CTACCGTCGA TACAACTCGC GAAGCGATTG CTAATCGCGA TTACAGCACC ACCAACGATA CGCCTGTGAG CAATCGGATT GAAAACGCTG TGCGCGATGT GACCACCGAT GATCAAGGCC ATGATACCTT CCGTGCTGCC GACCAAGCCT ACGATAACAG TAGCAAAGTT GGCACTGCTG GTGGTGGCGC TGCTGGTGCA GTAACTGGCG CAGCCATCGG TGCTGTTGGT GGCCCAGTCG GCGCAGTCGT TGGTGGCGTG GTTGGTGGAG TCACTGGCGC AGCAATTGGC GCAGTTGGTG ATACCGTTGG AGAACAAGCT GATACCGAAA CAGGTGCATT CAGCCATGAT AATCAAGCAC AATATCGAGG CACCGCCAAC GCAGTCGGTG ATAAGTTACG CGATGCTGGC AATAGCGTCG AACGCAAACT TGACCGTGAT ATCGACCGCG ATGGCGATGT AGGTCGCCGG GGCTAA
|
Protein sequence | MTTTVIGVFN SDAQAQAAVR DLQALGITND AISVVARDTS RAVDADGNLV TVSDDHMTAG EGLTVGAVWG GLVGLAALAI PGIGPLIGAG ALVSALTGAV AGAATGGIAG ALINAASVPE DQANVYEDRV KAGSTLVTVH ANDHMAAQVR TTLRQAGAER FQWDSDTDYD QSNEQAYADS SKVGTVGGGA AGAVTGASIG AAGGPVGAVI GGVTGAVVGG AIGAAGDTTG EKMNDRTHDS DYPHSTAYDT TNAYSSNATY DQMTTVDTTR EAIANRDYST TNDTPVSNRI ENAVRDVTTD DQGHDTFRAA DQAYDNSSKV GTAGGGAAGA VTGAAIGAVG GPVGAVVGGV VGGVTGAAIG AVGDTVGEQA DTETGAFSHD NQAQYRGTAN AVGDKLRDAG NSVERKLDRD IDRDGDVGRR G
|
| |