Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5253 |
Symbol | |
ID | 5737211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 24538 |
End bp | 26157 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282417 |
Product | hypothetical protein |
Protein accession | YP_001548008 |
Protein GI | 159901763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0754087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCGA CCCACCCATC TGGTGGGGCC ACCGCCATGC GCACGCCGCA TTGGTTCGCG GCCCCCGATA CCGACTATCT CTGGATTCCT TCCTATCTGC TCGAAACCCT GTATGACTCG CCCACGGCCA TTGGCCTTTT TGCGTTGATT GCGCGGCGCT GGCTCGCCAG TACCACCGCC ATGGTCGCCT TGAGCGATCA GGACATTCAG CGCTACGACC CAACTATCTG CCGTGGCTCG ATTCGCGCCG CCATCGATCG CTTGATTGGT GGTGGCTGGG TGCAGGTCGT GCGCCAACGT GGCCGCAAAA CCCAGTATTG CCCCGCGTGG GGCAAAAGCA CCAATACCCG CGCCTGGTCG AAAACGGGCA CGCAACTCAA TCGGCGGCGT GTCCGGACGG TGCGGTGTGA TCGCAAACTC TTTGACGACT ATATGGGGCG GATTATTCCG CATGAGCGGG TTCCTGCCGT GATTGAACGC TTTGGCGTGC GTGCCCACTT GAGCTTGGCC GATGTGGGGA CGTACCTGGT GATGCAACAT ACGCCGCATG CGATCAGCGC TACGCCAGCC CTTGAACAGC TCGATTTGTG CTATAACGCG GAAGCCTTGG CGGTTCCGAC GACCGAGGAA AGCTTGGCCA AAATGGAGTT AAGCGCCTGC GGAGCGCAGC GGCTGGGGTT GCTCAACGAA CCCCGCCCAC GCCCGATGAA GCAGCCCATG CTGAGCCAGC ATCTTTTTTT TGTGCCACCG AAGTTGGCTA GGCAGTTGGC TAGGCAGTTG GCTAGCCAAC ACGCGCTGAA CCAGGCAGGA AATAGCCCAT TGCAATCGGC AAAAACGGCG GTTGTGAAAG AAAGCACGGT GGTCACAGGC ATGTTAAGCA CTTTAGGCAT TGGAGATCCC CCTACCCCCA CAATCACAAC GAAACAAGTG CAAAAGAAAA CACTCTGTGG TGGAGAGTTC TCTTTTCGAG AAAATGGAGA GCGATTAATG ACCAAAAACG ACGAAGGGGA TACGACCAAT CAACGCTCCA ATCAGCCGCG CCGCCGCCGC AACGTTATGT CAATTCCAGA AACGCCAAGC GCCAAGCGCC TGCGCGAATT GAACGTGCGG CCCCAAACCT GTGTTGAGTT GGCGGATCTG CCCGTGGAGT TGATCAACGC CTGTATTGCC GATGGGCAGT CACGGCCTGC GGTCTATGAT CTGGCAGCGT GGACGGTCTC GATGGCGCGT GATGCGCGGG ATCATGGCTG GCAGGTCGCG CTCAACAAGC GAGGAGCGCC GCCGACGAAC CAGTGGGATG ATCCTGCGGT GGCGGTTGCT AAAGCCTTGG CTAGCGGTCT GTTTAACCGC GCGGATGACC TTGAAACCTC GCTGGATGAA CTCCCATCCA CCCCACCGCA ACGCGGTGGC GATCAGCAGG AGGCCACGGC GACCGATTGC CCTGCCTGGA TCGCGCCAGC AACGTGGCAA ACGCTGTCGC CTGGACTTCA ACATCTCTTG GAGCGCTCAC GGTTGCAGGG ACGGCAGGTC GTGGCCTATG ATAGCGGGCG ACAGCGCATG CTGGCTGATT ACGAGGCGCA GATCGAGCGC TTGGTGATGG CCGCCGTTAT GCGCCGATAA
|
Protein sequence | MPSTHPSGGA TAMRTPHWFA APDTDYLWIP SYLLETLYDS PTAIGLFALI ARRWLASTTA MVALSDQDIQ RYDPTICRGS IRAAIDRLIG GGWVQVVRQR GRKTQYCPAW GKSTNTRAWS KTGTQLNRRR VRTVRCDRKL FDDYMGRIIP HERVPAVIER FGVRAHLSLA DVGTYLVMQH TPHAISATPA LEQLDLCYNA EALAVPTTEE SLAKMELSAC GAQRLGLLNE PRPRPMKQPM LSQHLFFVPP KLARQLARQL ASQHALNQAG NSPLQSAKTA VVKESTVVTG MLSTLGIGDP PTPTITTKQV QKKTLCGGEF SFRENGERLM TKNDEGDTTN QRSNQPRRRR NVMSIPETPS AKRLRELNVR PQTCVELADL PVELINACIA DGQSRPAVYD LAAWTVSMAR DARDHGWQVA LNKRGAPPTN QWDDPAVAVA KALASGLFNR ADDLETSLDE LPSTPPQRGG DQQEATATDC PAWIAPATWQ TLSPGLQHLL ERSRLQGRQV VAYDSGRQRM LADYEAQIER LVMAAVMRR
|
| |