Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3196 |
Symbol | |
ID | 5736898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4041350 |
End bp | 4043323 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280342 |
Product | hypothetical protein |
Protein accession | YP_001545961 |
Protein GI | 159899714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00339167 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACTAT GGCACAGTCT GGACAACAGG CTGTGTCATT GTTTTAGCTC GCTTTTTGGA AGCCACATGC AGCAACATTC TCGATTAAAG CTTGGTTTGG CGTGGCTGGG CTTAATTGGG CTTTATAGCC TGACGCTCAC CCAAATTCAT ACCTTCGATG CCTACTCCTA CGCGACGGCA GTGCAAGCCA AGCCATGGCG CGAATCGTTT CATCCTCATC ATTTATTCTA TGGGCCGCTT GGTGAGATTA TCTATTGGCT CAGTCAAGGT TTGGGTTATC AAGGTATGGC ATTTGGGCCA TTGCAAATGC TGAATGTAGT GGCTGGAGCC AGCGGCGTGA TCATTTGGTG GCGTTTGCTT CAGCGTTTGA CCAACCAGCC TTGGCTGGCA ACCAGCGGCA GTGTGTTGGT TGGCGGAGCC TACGCTTGGT GGTATTACGC AGTTGAGGTC GAGGTTTATA CCTTGGCGAG CTTGTTTTTA ATTATTGCCA CTGGCTTGTT GATTCGGCTG GCTGAAACGC CCCAAATACT GAGCAATTGG CGCTGGCTTG GGCTAGCCCA TAGTGCGGCA ATCCTGTTTC ATCAAACGAA TGTGCTTTGG CTTGTGCCTG TGCTGGTGGT TTGGCTGAGC GCTGCGTGGG GCACTACCGC ATGGCAACAA CGCTGGCACG CATTTTTGCA GTATGCCATG GTTGGGTTGC TGGTGGTTGG TGGTAGCTAT GCCATCGTGA TGTTTGGCTT GAGTGGCTTC CGCACATGGC CGCAAGTGCA ACAGTGGCTG TTTGAATATG CCAATACTGG TTTGTGGGGC ACGACCAATG CCAACACCTT TGCCAATTTG CTCAACGGCT GGCAACATAC GATCCATGGT TGGTTAGGTG GTGCGGTTTT GCTGGCCAGC CTTGGCCTGA TCGCTTGGCG CTGGCGTTTG ATGTGGCGGC AATCACGAAT GTTAGTGGTG TTGGCCGCGA GCTGGCTCAT CACCTATAGC CTCTTTTTTG GCTGGTGGGA GCCAGATAAT ATTGAATTTT GGATTGCTTG CTTGCCGCCG TGGGCTTTAT TGATCACGCT TAGTTTGCAT ACGCTCAAGC TGCCGCAGCG CCGTTGGTGG CAGCCTGCCC TAAGTTTAGC CTTGGTTGGC ATGAGCCTTA GCAATGGCTG GCAAATTTAT GCCAATGGCG CTGCGGCCAA CGATCAAGAT CGCCAAATCA TCACTGAATT GGCCAAAACT GGCAGGCCCA ACGATTTTTA TTTTGTGCCG AATGGTTTGC AAGCCTTGTA TGCCGAATAT GAATTTGAGC GCCCAAACAG CCTGCCACTT AGCGTTAGCC CTGGCGATTG GCAGCATGGT TGCTTGGAAA TTAGCGCTAG AATCGTCGAT ACTACCAGCG CTGGCTATAC GGTTTGGCTT GATCAGCAGG CAGTCGAGCC TAGCCCAATT TTGCTCGAAC GTTACGGACT AGAGCAATCA GCGGTTACCG CATGCTTTGC CCCATTTTTA GCCCAAGCCC AGCCAATTAC CCTGACCCAT GCACGTTATC TCAAGCTCGA CCCGATTCCT CAAGGCTTGC CCGATTGGCA CTGGCAGAAT TGGAGTTTGG GCTGGCGCGA GAATTTTATT ACTGCTAGCA CTTGGGGCGC TGGCTGGACA TTTATCCCAC AGCAAGACCC ACATTTGCTT AGCCCACGGA TTAATCTGGC GAGCAGCCAA TGGCGCAAAC TTGAAATTAC TATGGCCAGC ACCCTGCCAA ATCAACATGG CCAACTGTAT TGGATGGCTC CTGGCGAGGG TGCAACCGAA GATCATTCGG TCTCATGGGA TGTGATTGGT GATGGAGCGC TGCATACCTA CACCCTTGAT CTTAGCCAAA TCCCAACTTG GCAAGGCTCA ATTGGCATGC TGCGGCTTGA TCCGGTTGTG GCGGGTGCTG AGCAGCAAAC CGTCACGATT CAGCGCCTAC GCCTGCTGCC CTAA
|
Protein sequence | MVLWHSLDNR LCHCFSSLFG SHMQQHSRLK LGLAWLGLIG LYSLTLTQIH TFDAYSYATA VQAKPWRESF HPHHLFYGPL GEIIYWLSQG LGYQGMAFGP LQMLNVVAGA SGVIIWWRLL QRLTNQPWLA TSGSVLVGGA YAWWYYAVEV EVYTLASLFL IIATGLLIRL AETPQILSNW RWLGLAHSAA ILFHQTNVLW LVPVLVVWLS AAWGTTAWQQ RWHAFLQYAM VGLLVVGGSY AIVMFGLSGF RTWPQVQQWL FEYANTGLWG TTNANTFANL LNGWQHTIHG WLGGAVLLAS LGLIAWRWRL MWRQSRMLVV LAASWLITYS LFFGWWEPDN IEFWIACLPP WALLITLSLH TLKLPQRRWW QPALSLALVG MSLSNGWQIY ANGAAANDQD RQIITELAKT GRPNDFYFVP NGLQALYAEY EFERPNSLPL SVSPGDWQHG CLEISARIVD TTSAGYTVWL DQQAVEPSPI LLERYGLEQS AVTACFAPFL AQAQPITLTH ARYLKLDPIP QGLPDWHWQN WSLGWRENFI TASTWGAGWT FIPQQDPHLL SPRINLASSQ WRKLEITMAS TLPNQHGQLY WMAPGEGATE DHSVSWDVIG DGALHTYTLD LSQIPTWQGS IGMLRLDPVV AGAEQQTVTI QRLRLLP
|
| |