Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4879 |
Symbol | |
ID | 5736956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6214452 |
End bp | 6216152 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641282045 |
Product | hypothetical protein |
Protein accession | YP_001547637 |
Protein GI | 159901390 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.413638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGTTTA TCCGAAGAGT GTGTTTCGTG TTGGTCTTTT TAGGTCTGGT TGCCAGTGCG TGGCTTAAGC CGGCACCAAT TGCTGCCCAA GCCCCATTGA CAGGGGATCA AACAGCCCCC TTGTTGAACC CAGTGAGCGA GGGCACTCAA CAAGATTTCG AGGCTGGCCT CAATACCGCA AATGGTGCTG TGCTCTATCA AACGGCATCA TATCCTGGGG TAAGTTTTCA CCGACGTAAT TCAACGGTAG GCTACGCCTA CTACGGCGGT GGTTGTACCT ATCTTACGGC GCTGAGCAGT GGGAACGAGG ATAACAACGC CCTTAGCACC CGCCTCGACA TTCCCGACGG TGCGGTCATT CTAGGGGTTG AATTTCAATA TCGCGATACC GACGCAGTTA ATAATTCACG CTTATATCTC TATCGTTTTG ATGGTGCTGG TGGCGTTGCT ACAGTCGCAT TATTAAATAG TAGTGGTAAT GGTGGCTATG GCTCAAGCTA TACTGCAACC AATCTCAATA CCCTCTACGA TGCCTTTAGC TATTCCTACT CGTTGGTTTG GTACAGCGGC AGCGTTGGCA TCAACCATGC ACTTTGTGGT GCACGGGTAA AATATGCTTA TAATCCGCCG GTTGCCCGAC CATTCACGAT GCCTCCTGCC AACCAACCAG ATGCCTTACA AGGTGGTAGC AGCGGCTATA GTTTTACTGC CGCTAGCGAT TTTGTAGCCT TTGAAGAGTC TGCTGCCTAT AGTTATTCTA GTGCCGGTTG TATGATTCAT ACTGGTGGAG CCAACCTTGC AGCTGAGGTT GATTTGGCTG ATGGCACGCA ATTGGCTGGC TATCGGGCAT ATTATTACAA TACTGCCAGC GGCGCAGCGA TGAATGCAAA TCTGGTATGG ATCGATGGCT CATCAACAAA TACCGTTCTA ATTGCATCTA CCACAGCTAG TAGCGGTTTT GCGAATGAGT ATTTTGTGCC ACCTTCGGCC CAGATTATTG ATGAGTTCAG TAAAGGCTAT CTGATGTATA TTCGGCCTGG AACGTCAACG ACGACCAGAA TGTGTGGTAT TCGTACATTC TATACGACTC CGGTCCAAGC CAAGCAATTG CCAGTTTATA TCAACCCGGT TAGCTCAGTC ATTGCAGATC ACGATGCCAG TGGCGTGCTA TTAACCGAAC AGCAGCCTGA ATCAGCCCCG ATCCAGAGCC TAGATCTAGG GACGGTTGAA GAACTAGCAA GCCCGCTGAT TACCAATGAA TACCAATTTG TCACAGCTCG TTCATTTATG CCACGCGATA ATGTCTATCT TGTGAACAGC GCCCCAGCTG GGTGTATTTC GTTTGGCTCT GATGCTGAGG TAGACTATAC CTTCCAATTG CCACCAAACA GTGGGCTACG AGGAATTCGC TTCTACTATC GTAATATTGC TGGCAATTTG GGCAATGCGC GTTATTGGGC CTTCGATGGA CGTGGCTGGT ATAATCCAAT TTTCACCTAT GCGATTCCAA CCTCAACGAA CTATGCTTCG CAGTTGGTAA GCTATACTGG TTCGCACTAC GATCTTTCCA ATGGCAGTGT GGCCCATTCA CTGAATTTCT CGATCCTTAG CCCGAGCAAT ACGGTTGAGT TTTGTGGTGC TCGGATTTGG TACACCACCG GCCTGCGTTC GCTCTACATT CCAGTTGCTT TCAAAAATTA A
|
Protein sequence | MWFIRRVCFV LVFLGLVASA WLKPAPIAAQ APLTGDQTAP LLNPVSEGTQ QDFEAGLNTA NGAVLYQTAS YPGVSFHRRN STVGYAYYGG GCTYLTALSS GNEDNNALST RLDIPDGAVI LGVEFQYRDT DAVNNSRLYL YRFDGAGGVA TVALLNSSGN GGYGSSYTAT NLNTLYDAFS YSYSLVWYSG SVGINHALCG ARVKYAYNPP VARPFTMPPA NQPDALQGGS SGYSFTAASD FVAFEESAAY SYSSAGCMIH TGGANLAAEV DLADGTQLAG YRAYYYNTAS GAAMNANLVW IDGSSTNTVL IASTTASSGF ANEYFVPPSA QIIDEFSKGY LMYIRPGTST TTRMCGIRTF YTTPVQAKQL PVYINPVSSV IADHDASGVL LTEQQPESAP IQSLDLGTVE ELASPLITNE YQFVTARSFM PRDNVYLVNS APAGCISFGS DAEVDYTFQL PPNSGLRGIR FYYRNIAGNL GNARYWAFDG RGWYNPIFTY AIPTSTNYAS QLVSYTGSHY DLSNGSVAHS LNFSILSPSN TVEFCGARIW YTTGLRSLYI PVAFKN
|
| |