Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3557 |
Symbol | |
ID | 5735416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4473017 |
End bp | 4474930 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280704 |
Product | hypothetical protein |
Protein accession | YP_001546321 |
Protein GI | 159900074 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAC TTCCACTGCG ACGATTGATT TTGTACAAAC ATGGCGTTGG CTATTTCGAG CGGCGCGGTC AGTTCGAGGG CACGAAACTG GCGTTGGCCT TCAGTCGCGA AGCCATGGAC GATGTGCTGA AAAGTTTGGT GGTGTTGGAA AAAGGCGAGG GTCATGTCAA GGGCATCGAT TACGAACGAC CTGAATCATG GAGCAGCCGA CGAATTGAGC TTGGCGCTGG CCGCCCACTG CGCGATCTCT TAACCGAATT ACGCGGGCGA CGAATTCGCC TAGCCTTGCG CGATGAAGCT GCTGCCGAAG GCTTGATCGT CGGCATCGAC GAGCCACCAC CAGAAGAGCC AATTTCGCAA TCGCTGGTGA CAATTTATCG CAGCGATGTG CGCCAAATCA CCTTGCATCG TTTAAATGAT ATTCGCGGCG TGCAATTGCT TGATTCGGCT GCCGACGAAG TGCTGTGGTC GTTACGCGCC AACAGCGATA AACAAGATTC GCGCACAGCC ACGATTCAGC TCGACGAGGG CAAGCACGAT CTTTTGGTGG CCTACCTTGC GCCAGCCCCC AGTTGGAGGG TTTCCTATCG GATTATTGTC GATAATATCG AAACCGAATT GCCCGATGTG CTACTCCAAG GCTGGGGTTT ATTCGATAAT GTGCTTGATG AAGATTTAGA AGATGTGCGG ATTTCGTTGG TTGCAGGCCG TCCAGTTTCG TTCCGCTACC CGCTCTACGA GCCGCAACAA CCGGAACGCC CGTTGATTGA AGACACAATT CGGCCTGCCG CACCACCAGC CTATACCTTG GCTGCTCCGG CTCCTGCTAG CGCACCACGA CCAAAAATGA TGCGAGCCAT GGCTATGCGT GAAGAAGCAT TTGCGGGAGC TGATCTTGAT AGCCTGAGCA TGGATGAAAT GGCATCAATT GCACCCGTTG CCGAAGGTGC TGCCCAAGGT GCGTTATTCC GCTACGATGT GCGCGAACCA GTCAGCGTTG GCCGTGGTCG TTCAGCTTTA GTACCATTGC TCAATTTACG CACCGCCTGT CGGCGCGAGT TGGTCTATCG CGGCGCGGCG GGCGAGACCC ATCCCATGGT CACGGTACGC TTTGAAAATA GCAGCGGCCT AACCTTAGAG CGCGGCCCAA TTACAATTAT GGAAAGTCGC TCATATGGCG GTGAAGCAGT CTTGAACTGG ACAGCCGAAG GGGCAATGGT CACAATTCGC TATGCCCAAG CCTTGGAAGT TTCAGTTAAA GAAAATCAAA ACTCGGCGCA TCAGACCCGG CGTATCCGGC TTGGTCGCGA TGTGTTAATT CACGAAGTTG AAGAATCGCT AACCACGATT TACACCGCAA CGAATACTGC TGGCGAAGCC CGCGTGATCA AAATTGAGCA TCCTTTGCGT CACCCCTACG AGCTGTTCGA TACCGTCCAA CCAGCCGAAG CCAATAGCCA ATTGGCCAGT TGGTTATTGC CAATTCCAGC GCGAGGCGAA GCGGCCTTGC GCGTGCGCGA ACGACGCTTG GTTGAACGGC GCGAACACAT TCGCTCGATC AATTTCGAGC AATTGCGCCG CTATTTGATG GATCGCATGC TCGATCAAAA TACGGTCAGC GAGTTGCGCG AAGTGCTCAC GCTGTATGCA CGAATTGATA GCATCGTTCA ACGCTACAAC GAAATTGAAG CGTTACGTCA GAAGATCTAC AATCAACAAA CTCAAATTCG TGGCAACCTC GATGTACTCA AAAACGAAGG CGGCGAGGGT GAACTCCGCA CCCGCTATAT CACCACCCTA GCCGAAACCG AGGATCAACT CAACGGCCTG CGTGAGGAAG AAGTGACATT GCGCAGTGAA GAAGCCGCCT GCCACGCCAA ACTCGAAGAG CATCTCAGCC GCTTTCCGGG CTAG
|
Protein sequence | MTELPLRRLI LYKHGVGYFE RRGQFEGTKL ALAFSREAMD DVLKSLVVLE KGEGHVKGID YERPESWSSR RIELGAGRPL RDLLTELRGR RIRLALRDEA AAEGLIVGID EPPPEEPISQ SLVTIYRSDV RQITLHRLND IRGVQLLDSA ADEVLWSLRA NSDKQDSRTA TIQLDEGKHD LLVAYLAPAP SWRVSYRIIV DNIETELPDV LLQGWGLFDN VLDEDLEDVR ISLVAGRPVS FRYPLYEPQQ PERPLIEDTI RPAAPPAYTL AAPAPASAPR PKMMRAMAMR EEAFAGADLD SLSMDEMASI APVAEGAAQG ALFRYDVREP VSVGRGRSAL VPLLNLRTAC RRELVYRGAA GETHPMVTVR FENSSGLTLE RGPITIMESR SYGGEAVLNW TAEGAMVTIR YAQALEVSVK ENQNSAHQTR RIRLGRDVLI HEVEESLTTI YTATNTAGEA RVIKIEHPLR HPYELFDTVQ PAEANSQLAS WLLPIPARGE AALRVRERRL VERREHIRSI NFEQLRRYLM DRMLDQNTVS ELREVLTLYA RIDSIVQRYN EIEALRQKIY NQQTQIRGNL DVLKNEGGEG ELRTRYITTL AETEDQLNGL REEEVTLRSE EAACHAKLEE HLSRFPG
|
| |