Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5234 |
Symbol | |
ID | 5737192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 1014 |
End bp | 2927 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641282398 |
Product | hypothetical protein |
Protein accession | YP_001547989 |
Protein GI | 159901744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.167052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAC CACCAGCAGC CCAACCCGCG TCCAGTGGCG ACGGCGGCGG GGCAATGATG GGCTTTTTCA TGGGCCTGAT CGTGGTGATC GTGGCCTTGC CCGCAGCGAT TGGGGCATTG TGGGGCAGAC GCTATATGAA AACCCACCAG CCGATGGTGG CGATGCTTGC GTTAGCAGGC GCAGGGCTGA TGGCCTTGGT CGTGTGGTCG CCGTTGCAGG ATCATTTAAA GGAGAGCCGC ACGGCGATGG AACGGGAAAA ACGCCGCGAT GGGATCATGG GCGTAGTCAG TGCGGGAGCC GGAGCGATTC CGCTGTTGTG GCTCTACACC ATCCCGCTCG CGCCTGCGTT GACGATGGTG TGGGAGAGTG TGCGGCCCAA ATCCTTAGCC GAGCAACAAG CCGACAAGGA TGCCCAAGCC CAGGAAAAGC GCACGCTTCA GCTGGACTCG GCCAAACGAC GGGCGCTGAA AGCGAGTGCG GCGAGCCTCA CCCATCGGCC TGTGACCAAG GAGATTGATA GCGCCACCGT CTTAGGGGCC AAGATTCAGG GCGATCCGTT GTTCTTTGCC AATGAGCATA AGCGGTTATT ATATATCACA ACGAGCATTG GCGCAGCCAG TCTGCATATG CTGTTTATTG GCGAGAACGG AAGCGGGAAG ACCATTTCAA TGTTGCGTTT TGCTGCTTCG ATAGCGGCTT CCACGAATTG GGACATCTTT TTCATCAACC CCAAAAACGA TGCCAAAACG ATGCAAGAAT TTTATGATGT GATGGCGTTT TATGATAGGC AATGTCGCTT GTTCCCGCAA GAAGCCTATA ACGGCTGGGA GGGGGATAGC GGGGCGTTGC TCAGTCGGAT TATGGCGATT CCGGCCTATG CAACCGAGGG CGCGGCTTCA TTTTATGCCG ACATGAGCGA GGTCTACTTA CGAGCGGTCT TGAACACCGA TGAGGCGTTG CCAAGCTCGT TTGAAGAACT CGAAGAACGC TTGCAATATG GCCGTTTGGC CGACCAATAT AAAAACAACG CGCAGGGGTT TGCGCGGGTG GCCAGCATCA GCGCAGCGGA TGCCAAGAGC GTCTTTATGC GCTTTGCGAC CATGACCCCC AAACTCACGC AAATTCGGCG GGATGGCTGG CGATTGAGCG AGGCCCGCGC CGCCTATTTT GGCCTGCCCG TGTTGGCCAA CGAGCGCGAT AGCCAAAGTA TTGCCAAGTT TCTCTTGGAG GACATCAAAC ACTACTTGTC CACCCGCAAG CCCAGCGACC GCCGCACGGT GCTGATTATC GACGAATTTT CATCCCTCGG AACCGAGAGC GTGATCCGGT TGGCGGAAAT GGCCCGCAGT TTGGGCGGGA TCGTGATGCT CGGAACCCAA ACACTGGCGG GCCTTGGTGA TGCCGACCAA CAGGCGCGGA TTGTTGGCAA TATGACCGTG GTGTTACACC GCATGAGTGC CCCCGAAGAA CTGACCAAGC TCGCGGGTGT TCAAAAGGTG ATGACGACGA TTCACCAGTT TCAGGGCAAG CAAATTTTGA AGCGCGGAAC CTACCGCATG GAGGAGGAGG CGCGGATCGA CCCACAGGAT GTCAGAACCT TGCCCACAGG CTGCGCGTGG GTGATTGCCC GTGGGGCAGC GGCCAAAGTC CAAATCGCGA TGATGCCCGC GGTGCCGCAT GTGCCGATTG TCATCCAGCG ACCCCGCCGA GCGCCGACCC CTGCCCAGGC ATTTGCGCCT GTTCCGGAGG ACGCGGCCAG TTTTGGAGCC ACGGTTACCC CAATGGCGGA ACCGCCCAAC CCCCAGCCGA TGATGGATCA CCACGACGAC CCCGCCGTGA TAGAGGAGGA AGCCCATGCC CACGACGCTG ACGAACGCTT CAGCTTTGGT GCGGGTCGCG TTGACCCCGC GTGA
|
Protein sequence | MSQPPAAQPA SSGDGGGAMM GFFMGLIVVI VALPAAIGAL WGRRYMKTHQ PMVAMLALAG AGLMALVVWS PLQDHLKESR TAMEREKRRD GIMGVVSAGA GAIPLLWLYT IPLAPALTMV WESVRPKSLA EQQADKDAQA QEKRTLQLDS AKRRALKASA ASLTHRPVTK EIDSATVLGA KIQGDPLFFA NEHKRLLYIT TSIGAASLHM LFIGENGSGK TISMLRFAAS IAASTNWDIF FINPKNDAKT MQEFYDVMAF YDRQCRLFPQ EAYNGWEGDS GALLSRIMAI PAYATEGAAS FYADMSEVYL RAVLNTDEAL PSSFEELEER LQYGRLADQY KNNAQGFARV ASISAADAKS VFMRFATMTP KLTQIRRDGW RLSEARAAYF GLPVLANERD SQSIAKFLLE DIKHYLSTRK PSDRRTVLII DEFSSLGTES VIRLAEMARS LGGIVMLGTQ TLAGLGDADQ QARIVGNMTV VLHRMSAPEE LTKLAGVQKV MTTIHQFQGK QILKRGTYRM EEEARIDPQD VRTLPTGCAW VIARGAAAKV QIAMMPAVPH VPIVIQRPRR APTPAQAFAP VPEDAASFGA TVTPMAEPPN PQPMMDHHDD PAVIEEEAHA HDADERFSFG AGRVDPA
|
| |