Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2251 |
Symbol | |
ID | 5734138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2869224 |
End bp | 2871887 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279392 |
Product | hypothetical protein |
Protein accession | YP_001545019 |
Protein GI | 159898772 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00410803 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTGA AGCCTGTTCA GCTTCTTATT ATGACGTTCC TGCTTTTGAC CATGGCTCAG ACTGGTGCGA GATTTCGTGC CTACGCGCGG TTTGACGCTC CAGTTCAATC AATCCTTGCT GGTGAATCGC GTGGTAGCAC CCAAATTGGG GCAGCTGGCA TCCAACGCAC CAGCGCTGCA ATTATGGCCA GCCAAGCCTT AGCCGATCAA CAACCGCAGC CAGTACGTAT AGCAAAACCA CGCTTTCAGC TTGATCGCCA GCGTTTAGCT GAAAATCCCA ATGCTCCAGC GGTTACCCAA TGGCCGACTC AAGCAGATAC TTCCCCAGTT GGAGCATCGA ATGCCACGGC CTTGAGCAGT TTGAGCACTA CTTTTACCGG TGCAACCTTG GCTGATACTA ATCGAATTCC GCCGGGCACC ATGGGCACGG TTGGGCCAGG CCAATTTGTG GTGGCAATTA ATGGCCGCTT GCGAACCTTC AATAAAGCGA CGGGTGTTGC TGATGGCGTG ATCGATAGTA CGCTGGAAAC GTTTTTTAGT TCGGTGATGA CCCCACCGAT TGCCAATAAC ATCACCAACG ATCCACGGAT TCGCTATGAT CGAATGACCC AGCGCTGGTT TCTGACGGTC ACCGATCTGC CAGGCCTTTC GGGCAATCAA GTCAATCGGC TGTTGCTGGC GGTTAGCGAT GCTGCCAGTG CGGGAGTTAT TACGCCTAGC ACGGTCTGGA CATTCTACTT TTTTCAAGGC AGCAACACCG ATGTGATGGA TTATGCGAGC TTAGGCGTTG ATGTTAATGC CTTGTATATT GGCGCTAACA TGTTCACCAC CGCTGGTGCC TTTGTTGGCA CCAATGGTTA TGTGGTGCAA AAAAGCTCGA TCCTTGGGGC TGGGCCAATG GTGGTAACCA CCTTTGCTGG GTTAGTGGCA GGTGGGACTG GCGCTGGCCC ATTTGCCCCA CAAGGGGTTG ATAATTTTGA CCCAACTGCA ACCGCAGGCT ACTTTGTGGG GGTTGATAAT GCCACATTTA GCACAATCAT GTTTCGGAGG GTGAGCAATC CAGGCAGTAT GACCCCCACG ATTTCGGCCA ATATCGCGGT GACCGTACCC ACCACGACCT TTCCTACCCG CGTTCCCCAC CTAGGCAATA CTGGGGGGGC GAACGGTCAA CTCGATGGCA TTGATGATCG CTTGTATTCA GCCATGATTC GTAATGGTCG GCTGTGGACG GCGCACAGCT TCAGAACCAA TGCCGCTGGG GTTGCAAGCA CCGCAACCGG GGCACGTAAC TCTGTGCGTT GGTACGAGTT TCAGAATCTT GATACGACAC CAACCTTGCG CCAAGCGGGA ACGGTGTTTG ATAATGCTGC TGCCAACCCG CTCTTTCAAT GGATTCCAAG CGTTGCTGTT TCAGGCCAAG GCCACGCTGT GATGGGCTTC AGTAGTGCCG GAGCCACAGC CCGCGCCAAT GCATCCATGA CCAGCCGCTT GGCTGGCGAT ACTCTTGGAA CAATGCAAGC GCCGACGCTC TATACGGCGA GCAGCTTTGA TTACAATCCC GCCGCTGATC CTGGTGGGGC GGCTGGGCGA CTTTGGGGCA CTACCTCCTA TACGAGCCTC GACCCCAGCG ATGATATGAC CATGTGGACG ATTCAGCAGT TTACCAATGC CACCGATTCG TATGGGGTGC AGGTTGTCAA AGTACTAGCT CCGCCGCCAG CAACCCCTAC TACCAGTAAT CCAGCCTCGG TCGATCAGGG AACGACGACG GATATCATCA TTACTGGTAC TTCGAGCGCT GGCTCGGGCT TTTATGATCC TGGGGCAGGG TTTAGCAACC GAATTACTGC CAGCATCAAT GGTGGCGGGG TAACAGTAAA TAGTGTTACC TACAACAGCC CAACCCAAAT TACCCTCAAC ATCACGGTTG CACCTGGAGC TAGTGCTGGT GCACGCATTG TGACGGTAAC TAACCCAGAT GGTCAGAGCC TTGACAGCAC CAGCGGGATT GTAACGATCG TTGCAGCCGC GACGGCCACT CCAACCAACA CACCGACGAA TACACCAACC AACACGCCAA CTAATACCGC GACGAATACA CCAACCAACA CGCCGACCAA TACACCAACC AATACCGCGA CAGCAACGGT TAGCAACACA CCAACCAATA CGCCAACCAA TACCGCGACA GCAACGGTTA GTAACACACC AACGAATACG CCGACTGGTA CACCAACCAA CACGCCTGTG CCCACAACAT TTATTCGATA TTTGCCCTTT GTAACCATGA GCCGACTTGG CTCAATTGCG ACCCTCGGAT CAGCTGCGAT ACCGACTAAC CCAATCGCAA CCCCAGGCCT CGTCTTTTTT ACCGGTACAA TCAGTCTACC GACAGTGCTA CCAAGTGGTG GAACGTATTG GCTTTCATCA AGCCCGAGCA GCCTTGTAGC AGGTTTAGTT GATGATGCGG TGCTTATACG GGCGGGGCCA ACCGAGCTAT TTCGTTACGA ATATGGAAGC AATGGACAGC CGCAGGCCGC GTTGGTCGAA GTGCCAGCAA ACATCTTGAT TCCAAGGGCT GGACAGACGC TAACGGTGGA GTTTGTGGAT CTCTATGGGA GTGTTTATAG CGCGACCCCC TTATACCTCG TTTGGACACC CTAA
|
Protein sequence | MRLKPVQLLI MTFLLLTMAQ TGARFRAYAR FDAPVQSILA GESRGSTQIG AAGIQRTSAA IMASQALADQ QPQPVRIAKP RFQLDRQRLA ENPNAPAVTQ WPTQADTSPV GASNATALSS LSTTFTGATL ADTNRIPPGT MGTVGPGQFV VAINGRLRTF NKATGVADGV IDSTLETFFS SVMTPPIANN ITNDPRIRYD RMTQRWFLTV TDLPGLSGNQ VNRLLLAVSD AASAGVITPS TVWTFYFFQG SNTDVMDYAS LGVDVNALYI GANMFTTAGA FVGTNGYVVQ KSSILGAGPM VVTTFAGLVA GGTGAGPFAP QGVDNFDPTA TAGYFVGVDN ATFSTIMFRR VSNPGSMTPT ISANIAVTVP TTTFPTRVPH LGNTGGANGQ LDGIDDRLYS AMIRNGRLWT AHSFRTNAAG VASTATGARN SVRWYEFQNL DTTPTLRQAG TVFDNAAANP LFQWIPSVAV SGQGHAVMGF SSAGATARAN ASMTSRLAGD TLGTMQAPTL YTASSFDYNP AADPGGAAGR LWGTTSYTSL DPSDDMTMWT IQQFTNATDS YGVQVVKVLA PPPATPTTSN PASVDQGTTT DIIITGTSSA GSGFYDPGAG FSNRITASIN GGGVTVNSVT YNSPTQITLN ITVAPGASAG ARIVTVTNPD GQSLDSTSGI VTIVAAATAT PTNTPTNTPT NTPTNTATNT PTNTPTNTPT NTATATVSNT PTNTPTNTAT ATVSNTPTNT PTGTPTNTPV PTTFIRYLPF VTMSRLGSIA TLGSAAIPTN PIATPGLVFF TGTISLPTVL PSGGTYWLSS SPSSLVAGLV DDAVLIRAGP TELFRYEYGS NGQPQAALVE VPANILIPRA GQTLTVEFVD LYGSVYSATP LYLVWTP
|
| |