Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4585 |
Symbol | |
ID | 5736430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5866108 |
End bp | 5867619 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641281747 |
Product | FG-GAP repeat-containing protein |
Protein accession | YP_001547344 |
Protein GI | 159901097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000887674 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAATTGC GTCAATTACT TGGGTTATGC GGCTGTGGCA TTGCCCTTGG ATTAAGTGGC TGGGCTTTTG CTGAATCCCC GCAATTCAGC GAAAGTCTCG CCGAAACAAC AATTGTGGCG TTTACGGGCG GTGCTGGCGA ACACGCTGGT TATGCCGTGG CCGACGCTGG CGATATTAAT CAAGATGGCT TGGCCGATAG CTTGGTCGGT GCTTGGGTCG CTGATCCGTT GGGTCGCAAC AACGCTGGCA CCAGCTATCT AATTTGGGGT CAAGCGCTTA CCGCAACCTT GCAAGCCCCC GATCTGGCCG AACGGGGCGT GGCAATTTAT GGGGCCAGTG AGGGTGCTTC ATCTGGCTGG AGCGTCACTG GCCTTGGCGA TGTCAACGGC GACGAAATTG ATGACTTTGC GATCGGGGCA TGGGGCGAAT CGCCCAATAA CCGCGCCACT GCTGGCAGTG TGTTTGTGGT TTGGGGCGGC TCGCTCACCA CAACCCTCGA TTTGGCGGCG CTTGGCAATC ATGGCTATCG GATTGATGGT GCGGTTGCAG GCGATCGCTT GGGCTATGCC TTGGCTGGGG TTGAAGATCT CAATAATGAC GGTTTGAACG AAATCGTGAT CGGGGCAATT GGGGTCGATG CCAGCGCCGA TAATGCTGGC GCTGCCTATG TGGTTTGGGG CAAAACCACC ACCACAACCC TCGATTTGGC AACTGCTAGC AATTATGGCT ATCGGATCGA TGGCGCTGCT GCAAGTGATC GGGCTGGTAG CGCGGTTGGT AGCACCAGCG ACATGAACGG TGATGGCAAG CCTGAGATTT TGGTAGGCTC GTATGTGGCC GATCCATTTG GGCGTAGCGC TGCTGGTAGC GTGGCCGTCG TATGGGGTGC TAACACCACC GCCTCATATC CTATCGGAGC GCTGGCCCAG AATGGCTTTG TGATTGCCGG GGCTGGAGCG AGTGATCGCG CTGGCATCAG CTTGGTTGGC ACCGGCGATC TGAATGGCGA TCAACGAGGC GATTTGGCGG TTGGCAGCGA TCAATTTCCT GCTGGCGGCG CTGGTCGAGT TGATTTGATT TATGGATCAG CTTTCAGTGG CACGCTTGAT TTGGCGCAGC CGTTGAGCAA TACCGTGCGC TTCGTGGGAG AGCAAGCTGG TGATGAGGCT GGTTTTTCGG TGGGCTATAG CGATCAACGC CTAATTATCG GAGCCTATGG AGTTGATAGT AGCCTTGGCA CTGATACAGG TCGGGTGTAT GTGGTCAATA CCCCATCGAT CAGTAATACA ATTAATTTGG CCAATTTGAC GATAGAACAG GGCTTTAGCC TTGATGGGGT TGTTGGCGGT GGCCGACTTG GGCGGGCAGT AGCCGGTTTG GGCGACGCTA CTGGCGATGG ACATGGCGAT TTGCTTCTGG GAGCCGACCT TGCTGGCAGC CAGATCGAAG GCTATGCCTA TATCGTTGGC CGCCTGCCAA CAACAGTTTA TCTGCCGATG ATCATGCGGT AA
|
Protein sequence | MKLRQLLGLC GCGIALGLSG WAFAESPQFS ESLAETTIVA FTGGAGEHAG YAVADAGDIN QDGLADSLVG AWVADPLGRN NAGTSYLIWG QALTATLQAP DLAERGVAIY GASEGASSGW SVTGLGDVNG DEIDDFAIGA WGESPNNRAT AGSVFVVWGG SLTTTLDLAA LGNHGYRIDG AVAGDRLGYA LAGVEDLNND GLNEIVIGAI GVDASADNAG AAYVVWGKTT TTTLDLATAS NYGYRIDGAA ASDRAGSAVG STSDMNGDGK PEILVGSYVA DPFGRSAAGS VAVVWGANTT ASYPIGALAQ NGFVIAGAGA SDRAGISLVG TGDLNGDQRG DLAVGSDQFP AGGAGRVDLI YGSAFSGTLD LAQPLSNTVR FVGEQAGDEA GFSVGYSDQR LIIGAYGVDS SLGTDTGRVY VVNTPSISNT INLANLTIEQ GFSLDGVVGG GRLGRAVAGL GDATGDGHGD LLLGADLAGS QIEGYAYIVG RLPTTVYLPM IMR
|
| |