Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2330 |
Symbol | |
ID | 5734202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2973897 |
End bp | 2976494 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279471 |
Product | hypothetical protein |
Protein accession | YP_001545098 |
Protein GI | 159898851 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCCAA GCTTTGTGCA ACAACGATTC AGGTCAGTAG TTAGTGCTCT TATCATTTTA ACGTTTGGTC TTGGTTCATT TAGTTGGCTG TTACAATCGG CGTTTGCCAA TACGATTGAT GCAACACCAT ATTTAAATCC ATTAGCGCCC GATTTTGGCA TTCAAGCGGC GATTGATGCT GCGGCTGCTC AGGGGGGCGG CACGGTGCGC TTACCCGCTG GTAGCTTTAC CCTCGAAACC TACCTCGATC TTAAAACTGG CGTGACTTTG CAAGGGGTCG GGGCTGAGAC GATTTTAAAG GCTGGTCGTA ACGAGCAACG GGTTTTTGTC ACGCAAACTG GCAGCAATCT TTCAACAATT AAAGTTGCCA GCGTTACGCC ATTTCGAGTT GGTATGATCG TCTATGTCTG GCGTTCGACA GAATTGCGCT TCTTGCCTGG CTCCTATGAA ATTATGAGCA TCAATAGTAC TAATCAAACG ATCACGCTTG ATCGCGCGGT CAATTACCCG CTGACTGCTA ATGTTTCGCA AGTGTCGTAT GGTTTGTACA CCAAATTGAC CGGTGCTGCC ACCCAAGGAA CCAATGTGAT CAGCGTGGCC GATACCAGCG TGTTCAATCC AGGCGAAGGC ATCATTATTA AAGGAACTGA AGGTACAGGC ATTGGCAATT GGGGTGTTGA GCAAAATATG GTCGATTCGA TCAATACCAG CAACAACACG CTGACCCTCA AAAAGCCTTT GACGCTTTCA GTGCCCAATA ACTCAGTCGT ATCGCACGCC TATTCGGCGA TTTTTGCGCT TGGAACCAAT TTCAACAATC GCTTGCAAAA TATGGGTGTA CGCGATTTGA CGATTGAGGG CTGGAACACC AACCAAAAGC CGGCCTTCTA TGAGTTTTAT ATTGGAGCGA TTAACTTTGT CTATTGCCGC TTTGTGACGA TCGATAACAT CACCGTGCGC TATTGGCATA GCGATGGCGT AAGCTTGCAA TCGTGTGATC AAAGCACGGT CAGCAATAGT CTGGCAACTG CTAATCGTGG CCATGGCTTC CATCCAGGTA CTGCTTCACG CGATATTGAA TTTTTCAAAA TTCAAGGGAT TGGCAATTTG GGCTATGCAG CACGTGGGAC TGCTGGCGAT GGTTTGTACT ATTGTTGGGC TAACCAACGA GTTAATATTC GCCAGAGCGT TTTCCGTAAT AATGCTGGCT CAGGCGTTGG CGATTTAGGT GGTGGCGATA CCGATAATTC CTCACGTGAT ACCGATAACA TCATTGAAGA TAGCATTATG GAAGGCAATT TCCGCGCTGG GATTGAGGTT AATGGTGGTG GCAATACGGC CAATAACATT ATTCGCCGCA ACGTGATTCG CAATAACAAC ACTGGCAATC AAGATTATGC TGGCATTAAC TTGCTTTCCA AGCGCGGCCC AGTCCAACGC TATATCATCC AAGATAATAT TGTTGAAAAC ACAGCGGGCA GTAACCAACT CTTTGGGATT CGTGAGGTCA ACTTGGCTGT GCCACCCACC ACCCCAGTTG ATTATCTGAC CGATTTCAAC ACGATCACCA ATAACACGAT TTATAACCAC CCAAGCAATA ATTTGGTAGT GATTGGCCCC AACACCGTCG CCACTGGCAA TATTTTCACT GCACCAGGCG CGGTGATCAC ACCAACGCCA ACCAATATTC AACCAACTGC GACTGCTACA ATCGCGCCAA CCAACACTCC AACCCCAACT GGTTCGTATG TGCCACGCTT GATTATGTAC AATGCCGATA CTGATCAAGT TATGTATGAT CCAATTCCCA ATGGCGTGAC AATTAATTAT GCGACGCTGG GAACCCGTAA TATCAGCATT GTTGCTCCAA CCGCGCCTTC GAGCGGGATT GGCAGCGTGC GTTTTTGGGT TGATAGCGTG GTTTATCGCA CCGAAAGTGG TCGGCCTTAT TCAATCGCTG GCGATCAAAC CAATGGTACA GATTTCTTGC CGATGAATCC GGCCTTGGCC CATGGAACCC ATGTGATCAA AGCAGCCACC TACACAGGTT CAGGTGGAAC TGGCACACAA GGCACACCCT ATCAAATTGT GATTAATATT GTTGATAGTA ATGCCACGGC TACGCCAATT CCAACCAACA CCAATACCCC TGTACCAACT GTACCAACCG CGACCGCAAC TGCAACGAAC ACGCCAACCA ACACGCCGAC CAATACGGCA ACCAACACGC CAACGGCAAC AGCAATTGCA ACCGCGACTA ATACGCCAAC TGAGATTGCA ACGCCCACGG CCACGGTAAC GGAAGTGGCG ACGGCCACAC CAACCGAAAT CGCCACGATC ACCGCAACCG CGACTGACGT TGCCACGGTC ACTGCGACAG AAATTGCTAC AGCTACGGCG ACAGCGACGA TTACGGCAAC ATTAACGAAC ACACCAACCA ATACACCGAC TAATACTGCA ACTGCAACGC TGACTGAGAC ACCTACGGCG ACGCTTGAGC CAAGTGTTAC ACCAAGCAAT ACACCAACGG CGACAACCAC GGTTACGACT CCAGTGCCGT CAACCCATCA TGTTTATGCG CCATGGGTTA CCAACTAA
|
Protein sequence | MQPSFVQQRF RSVVSALIIL TFGLGSFSWL LQSAFANTID ATPYLNPLAP DFGIQAAIDA AAAQGGGTVR LPAGSFTLET YLDLKTGVTL QGVGAETILK AGRNEQRVFV TQTGSNLSTI KVASVTPFRV GMIVYVWRST ELRFLPGSYE IMSINSTNQT ITLDRAVNYP LTANVSQVSY GLYTKLTGAA TQGTNVISVA DTSVFNPGEG IIIKGTEGTG IGNWGVEQNM VDSINTSNNT LTLKKPLTLS VPNNSVVSHA YSAIFALGTN FNNRLQNMGV RDLTIEGWNT NQKPAFYEFY IGAINFVYCR FVTIDNITVR YWHSDGVSLQ SCDQSTVSNS LATANRGHGF HPGTASRDIE FFKIQGIGNL GYAARGTAGD GLYYCWANQR VNIRQSVFRN NAGSGVGDLG GGDTDNSSRD TDNIIEDSIM EGNFRAGIEV NGGGNTANNI IRRNVIRNNN TGNQDYAGIN LLSKRGPVQR YIIQDNIVEN TAGSNQLFGI REVNLAVPPT TPVDYLTDFN TITNNTIYNH PSNNLVVIGP NTVATGNIFT APGAVITPTP TNIQPTATAT IAPTNTPTPT GSYVPRLIMY NADTDQVMYD PIPNGVTINY ATLGTRNISI VAPTAPSSGI GSVRFWVDSV VYRTESGRPY SIAGDQTNGT DFLPMNPALA HGTHVIKAAT YTGSGGTGTQ GTPYQIVINI VDSNATATPI PTNTNTPVPT VPTATATATN TPTNTPTNTA TNTPTATAIA TATNTPTEIA TPTATVTEVA TATPTEIATI TATATDVATV TATEIATATA TATITATLTN TPTNTPTNTA TATLTETPTA TLEPSVTPSN TPTATTTVTT PVPSTHHVYA PWVTN
|
| |