Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1192 |
Symbol | |
ID | 5733085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1371697 |
End bp | 1373736 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278332 |
Product | hypothetical protein |
Protein accession | YP_001543968 |
Protein GI | 159897721 |
COG category | [S] Function unknown |
COG ID | [COG1306] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000450505 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTTG TGAGTCAAAC CAAGCGTCGC CTAACAGCTG GTCTGTTTGG CGCTGTTGCG TTGGTGCTTG CCGCCTGTGG CAGTAGCAAT CCGCCACTGA CGGGAATTGT AACAGATAGC TATACCCAAA AGCCCGTTGC TGGAGTAACC ATTCAAGTTG GCGAAGCAAG CGCTACGAGT GATGCTGATG GTAAATGGAC GATTAATGAA TGGGAAAATA CCAATTCACT CTTAGTTCAA GCCAGCGATT ATAGCTCAGC TACGCTTAGT TTGGCCGATA AAACTCCTGT CGATGAGCAA ACTCCGGTAG AAGTTAATCT GACGATTCGT CCCAACACGA TCAGCGGGGT TGTGCTTGAT CAATACTCAC AACAGCCTGT GGCTGGCGTG ACGGTCAAGG CTGGCACGAG CCAAGCCACC AGCGGCGCTG ATGGCCGCTA CAAATTGACC GATGTTGCTG AAAAGGCTGA AGTGGTGATT GTGGCAACCG ATTACACCAG TGCCACCGCT ACCCTCGAAA AACAAACAAG CTACGATGTT TCGCTGCGCC CAACCAGCTT AACTGGGATC ATCAGCGATA AATATAGCCA AAAACCAGTC AGCGGAGCCA CGGTCAGCGT TGGTAGCGCC ACGGCCCAAA GCGATGCTGA AGGTCGCTAT ACAGTTAAGA ACATCGATTT GGATGCACCA GTTGTTTTCA GCGCTACCGA TTACAGCAGC CAAACCTTAG AATTGCCGCA AGCCGCCTCG TTGGATGTGG TTTTGCGACC TTCGACCGTG CGTGGCTCAG TCGTCGATAG TACAACAGGC AAGCCGTTGA CCAAGGGCAC GGTTATCGCC ATGGTCAAGC CATTTGAAGG GGCTGATGAA ACCTATCCCT ACACTGGCAC TGCCGTTACT ATGGCACGGC TGAATGCCGA TGGCACCTAT GAATTAACCG ATGTGCCTGA AAATGCCCAA ATTCAGGTGC TCTCGCCTGG GTATCGCAAG GCTTGGACGG CGCTCAGCGA AGGCAAATTT ACCGCCGATC TAGAAGCTGA AGAATTTGTA GCCAAAGCAA TTTATATTAC CGCAGCAACT GGCTCGTCAA AAGCTTCATT AAGCGAATTG TTTGATTTGG TTGATCAAAC CGAAGTTAAT GCGGTGGTGA TCGATATCAA GCTGGATATT GCTGGCGATG TTGGCGGAGT AGGCTATCTC TCGCAACATC CATTGGTATT GGCCGCCGAA ACCTCATCCG ATTATTTGGA TATGGAATGG ATTGTGGCCG AAGCTCGCAA GCGCGATATC TACTTAATTG GCCGCATGGC GGTAATGCGC GATAATCGTT TGGCCGATGC TCACCCCGAA TGGGCCGCCC AAAGCAAGGC CACTGGCGGA GTTTGGGAAG ATGACGGTGG TCTCAAGTGG CTTGATCCAT TCAACCCCAA CGTCACCGAG TATAATGTGG GCATTGCCAA AGAAATTGCC GCATTTGGCT TTGATGAAGT ACAATTCGAT TACATTCGCT TCCCATCGGA TGGCAGCACC AGCAATTTGG TTTTCTCCAA GCCGATTGAT CCCAAAAATA ATCCGGAAGT GATGTACGAA GCAATTGGCA ATGTGCTCAA ACGCGCTCAT GGCGATATCA ATGGTTCAGG CGCATTCTTC TCAATCGACG TGTTCGGTTA TGCCACATGG CGTAATATGT GGGAAATTGG CCAAAGCCTT GAAATTATGG CCGATCACAC CGATTATGTC TGTGCAATGG TCTATCCTTC GCACTACGAT CGCAATGAGT TGGGCTTCGA TAACGCCGAT GCCTACCCTT ATGAGATCGT CAAGGATAGT ATCGAAAAAG GCCAAAAGCG CATGGAAGGC AAATACGCAG TGCAACGACC GTGGCTTCAA GCCTTCACCG CGACATGGCT TGATCCAGTA ACACCATATG GTCGCACCGA AGTTCGCGCC CAAATGCAAG CAGTCGCCGA AGTCGAAGGC ACGTATGGCT GGATTCTCTG GAATGCTGCC AATTATTACG ACCCCGACTG GCTCGATTAA
|
Protein sequence | MYFVSQTKRR LTAGLFGAVA LVLAACGSSN PPLTGIVTDS YTQKPVAGVT IQVGEASATS DADGKWTINE WENTNSLLVQ ASDYSSATLS LADKTPVDEQ TPVEVNLTIR PNTISGVVLD QYSQQPVAGV TVKAGTSQAT SGADGRYKLT DVAEKAEVVI VATDYTSATA TLEKQTSYDV SLRPTSLTGI ISDKYSQKPV SGATVSVGSA TAQSDAEGRY TVKNIDLDAP VVFSATDYSS QTLELPQAAS LDVVLRPSTV RGSVVDSTTG KPLTKGTVIA MVKPFEGADE TYPYTGTAVT MARLNADGTY ELTDVPENAQ IQVLSPGYRK AWTALSEGKF TADLEAEEFV AKAIYITAAT GSSKASLSEL FDLVDQTEVN AVVIDIKLDI AGDVGGVGYL SQHPLVLAAE TSSDYLDMEW IVAEARKRDI YLIGRMAVMR DNRLADAHPE WAAQSKATGG VWEDDGGLKW LDPFNPNVTE YNVGIAKEIA AFGFDEVQFD YIRFPSDGST SNLVFSKPID PKNNPEVMYE AIGNVLKRAH GDINGSGAFF SIDVFGYATW RNMWEIGQSL EIMADHTDYV CAMVYPSHYD RNELGFDNAD AYPYEIVKDS IEKGQKRMEG KYAVQRPWLQ AFTATWLDPV TPYGRTEVRA QMQAVAEVEG TYGWILWNAA NYYDPDWLD
|
| |