Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5123 |
Symbol | |
ID | 5737081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 165724 |
End bp | 168573 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282288 |
Product | hypothetical protein |
Protein accession | YP_001547879 |
Protein GI | 159901633 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATA GCATTAATGG TTCTGTCGAT CTCGATAAAT CGCAAGTAAC TGGTGTGATC GTTGGAGTTA ATCTTGGCAC GATTATCTAT GGCCGTCCAC CAGAGGAAGC CGAGCGCCAG CGCTTAGTCG CCTACTTGGA TCAGGTGACC AAAAGCCATA ATACGCTGCG GGTAGTCGGG GTTGGCTCGT CCCATCTCGC GTCAGGCATT GACCTCGCAT CCGCCTATAT GATGCTGGCG GTGCAGGGGC GGCAGCGGGT GGTGCGGACA CTGACGGCGG AAGAAATTGC GGCCCATCGC CAGCAGCGCT TTGAAATTCC TGAGGAACTG AGCGCTGATC GCTGTTTGCC CGATCACGCC GTGCTTGCGG TTGCCAAGCG AGATGGTTAC TTGGCGTTGC TCCGGGCGGA ACTGGCGACG GAAACCGTGT TGGCGCATCC CTACCTCGTA TTGTGTGGCG CACCGGGGAG CGGCAAATCA ACCTTCGCTA AGCATCTGGT GTGGGCCTTG GCGCAGCGTG GCCTTGACCA GATTAATCAT CACACGGGCT TGCTTGGCTG GGCTGACAAA CAGCGCGTGT TGCCCGTGTT TATGCCTTTA CGCACGTTGG CGGGTGCGTT GGTGGGCAAG GATTTAGGGT TGAACAACAC CCCCCATATT GGGCTGTTGC TTGATGCGGT GTGTGCCCAT CTGCAAACGA CCTATGGGCT TGAGCAGCCG CGTGAGCTTT TAAGTGCTGG ACTGGATCGC TCGCGCACGG TCTTGTTGGT GTTTGATGGC TTGGATGAAG TGCCACTGGA AGCCACTGAC CACAGCCTTG ATCGCCGCTC GCTCTTGACC TATGTCCGCT TGTTTGCCAA TGCCTATGCT GCTCGTATCC TCATCACCTG CCGCTCGCGG GCCTGGACGG AGGAGTATGG ACAGATCACG CAGTGGCCAA TGGTTGAACT GGCTCCGTTG AGCGGTGGCC AAATGACCCA GTTTATTCGC ACATGGTTTC CATTGTTGCA TGCCAAGGGT CTGATTGATC ATGAGGCCAT TGAGCGCTAT AGTGATCAGT TGACGCAGGC GTTGCGCGAT CCCCAGCGCC GCCGCTTACG GGACATGGCC GACAATCCGT TGCTGTTGAG TATGATGATT TTTGTGTTGG CTCGCAAGGG TGTGTTGCCG CGTGACCGCC ATAGCCTGTA TGACGATATC CTGAAACAAC TCTTGGGCGA GTGGGATACC ACCAGTCGCA ATGGGCAGAA CTTGGGGCAA GCGGTTGGGG ATGATCGGAT CATGGGCGAC GAGGTGCGCG ATCAGGTATT GGATCGGTTG TGTTATCAGG CGCATTTAAC CGCCACGTCA ACGGATGGGC GTGGACGAAT TCCGAGCCGT GAGCTTCAAT TTGCTTTGAT GGAGTATTTC GCCCGCGTCA ACGTGGCCGA CCCCTATCGG GCGGCGGAGC GCTGTATCGC CTATATTGAT CAATGCAGCG GCTTGCTTCA GCCGGAGGAT GATGGGAAGG TCTATGCCTT TGCCCACTTG ACGTTGCAGG AACAGAGTGC TGGTCGCCAC TTGGTGTTTT ATGAATCACT CGATCAGTTG TTGGCCTTAC GCCGTGATGA CCGTTGGCGC GAACCGATCT TTTTAGGCGT TGGCTGCCTG ACGAAGGTGG GGCTTGGAAG TGCCAAAATT GACCAACTCC TGACGACCTT GGTTGACTCC GATGCCTATG AAGCGGGAAC CATGCATCAA TACGACTGGT ATCGCGATCT GATCTTAGCT GCTGAGTTGG GGGCGGACTG CGACTGGGGC TTGTTGCGCG GCAAGCAGAT CAAGGTTGAT CGCATCCAGC GACGGTTGCG GGCGGGGCTG GTTAACCTGC TTGAAGACCA CGACCATTCC CACGCAGCAC TTGCGTATCA CCACGGCCAA GCGATGGAGC CAGCGCCGTT ATTGGTGCGC GAACGGCAAA AAGCTGCCGA ACTTCTCGCA GGCTTGGGCG ACCCACGCTA TCCGGTAACC ATAGCGCAAT GGCAACAGGA GACGCGCGAT CTGTCCACCC AGTTTGGCCG CGAGGGCAAC CATTATTGGC GCTACATCCC TGCGGGCCGT TATCAGGTTG GCGGGTGGGA TGCAGACGAA CAATCCACAG TGGTTGAACT TCAGGATTAC TGGGTCGGGC GGTTTATGGT GACGGTGGAA CAATATCGGG CGTTTATGGA GGCTGGTGGC TATACGAATA AGGATTATTG GACGGAACAT GGATTACAGT GGAAGCAACG CGAACAACGA ACAGAACCAC GCTGGTGGTA TGACCAAACC GAGCAAGAAT ACCGCAATCG ACCATTCTAT GGAGTGAGTT GGTATGATGC GGTGGCCTAT TGCCAGTGGC TGACGGATCA GCTTACGCCA TGGCTGCCGC AGGGGTATTG TATTCGGTTG GTCAGTGAGG CGGAATGGGA GGTGTCCGCT GCCTATACCG CCGACGGACA GCGCCAACCG TTCCCGTGGG GTGAGCAGCC CGCCACGCCG GAGCATACGG TGTACAATTG GAGCAGGGAA AAACGCCCCT TATCCGTTGG TTTAGGGCTG GTTGGCCAAG CGGCGTGTGG CGCACTGGAT AGCGTTGGCA ACATGTGGGA GTGGACGGCC ACGCGCGATG AGGACAACGG TGGCAATGGG CAGCAGGTGC TTGCGGATAG TGACGACCTT ATGGTGCTGC GGGGTGGCTC AGGGTACGAA AATAGTATAA ATGTTCGTTG CGCGGCGCGT CTCAGGAATC CTCCCGGCAA CGGCGTCACC ATTCTTGGAT TTCGTTGTAT TCTCGCCCAT CGTACATCTG TTCTGAATCC TGAATCCTAA
|
Protein sequence | MADSINGSVD LDKSQVTGVI VGVNLGTIIY GRPPEEAERQ RLVAYLDQVT KSHNTLRVVG VGSSHLASGI DLASAYMMLA VQGRQRVVRT LTAEEIAAHR QQRFEIPEEL SADRCLPDHA VLAVAKRDGY LALLRAELAT ETVLAHPYLV LCGAPGSGKS TFAKHLVWAL AQRGLDQINH HTGLLGWADK QRVLPVFMPL RTLAGALVGK DLGLNNTPHI GLLLDAVCAH LQTTYGLEQP RELLSAGLDR SRTVLLVFDG LDEVPLEATD HSLDRRSLLT YVRLFANAYA ARILITCRSR AWTEEYGQIT QWPMVELAPL SGGQMTQFIR TWFPLLHAKG LIDHEAIERY SDQLTQALRD PQRRRLRDMA DNPLLLSMMI FVLARKGVLP RDRHSLYDDI LKQLLGEWDT TSRNGQNLGQ AVGDDRIMGD EVRDQVLDRL CYQAHLTATS TDGRGRIPSR ELQFALMEYF ARVNVADPYR AAERCIAYID QCSGLLQPED DGKVYAFAHL TLQEQSAGRH LVFYESLDQL LALRRDDRWR EPIFLGVGCL TKVGLGSAKI DQLLTTLVDS DAYEAGTMHQ YDWYRDLILA AELGADCDWG LLRGKQIKVD RIQRRLRAGL VNLLEDHDHS HAALAYHHGQ AMEPAPLLVR ERQKAAELLA GLGDPRYPVT IAQWQQETRD LSTQFGREGN HYWRYIPAGR YQVGGWDADE QSTVVELQDY WVGRFMVTVE QYRAFMEAGG YTNKDYWTEH GLQWKQREQR TEPRWWYDQT EQEYRNRPFY GVSWYDAVAY CQWLTDQLTP WLPQGYCIRL VSEAEWEVSA AYTADGQRQP FPWGEQPATP EHTVYNWSRE KRPLSVGLGL VGQAACGALD SVGNMWEWTA TRDEDNGGNG QQVLADSDDL MVLRGGSGYE NSINVRCAAR LRNPPGNGVT ILGFRCILAH RTSVLNPES
|
| |