Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1041 |
Symbol | |
ID | 5732945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1187599 |
End bp | 1189644 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278176 |
Product | hypothetical protein |
Protein accession | YP_001543817 |
Protein GI | 159897570 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAATC GTTTAATTCA TGAAACTAGC CCTTACTTGT TGCAACACGC CGAAAACCCT GTCGATTGGT ATGCTTGGGG TGAAGAGGCT TTGCAACGCG CCAAACAAGA TGATAAACCA ATTTTATTAA GTGTTGGCTA TAGCGCATGC CATTGGTGTC ACGTTATGGC TCATGAATCG TTTGAAGATC CAGCGACTGC TGCGGTTATG AACGAATTAT TTGTCAATAT CAAGGTTGAT CGTGAGGAAC GACCTGATAT TGATTCACTG TATATGGCCG CTGTGCAAGC CATGACTCGC CATGGCGGCT GGCCGATGAC GGTCTTTTTA ACGCCTGATG GCGCACCGTT TTATGGTGGG ACCTACTTCC CCCCCGAGCC GCGCCACAAT ATGCCTTCGT TCCAACAGGT GCTACATGGC GTGGCCGAAG CTTACCGCGA CCGTCGCGAA GAAGTGTTTC AGAGCGCCGA GCAGATGCGC GAGCATTTAG AAGATATTTT GAGCTTCGAT CTTGAGCAGG TGAAGCTGAG CAAAAGCCAA TTGAATGTGG CTGCTCAACG CCAAATGAGC CAATTCGATT CGCGCTTTGG TGGCTATGGC GGTGCGCCGA AATTTCCGCA AGCCTTGATT TTTGGCATGG TTTTGCGTAC ATGGCTGCGC AGCGAGGATC AAGATGCGCT TAATCAAGTG ACCCAAACCT TGCAAGCCAT GGCCAACGGT GGCATGTACG ATCAGCTTGG CGGTGGCTTT GCACGATATT CGGTCGATGC TCAGTGGCTC GTGCCGCACT TCGAGAAAAT GCTCTACGAT AATGCTTTGC TCAGCCAGCT CTATCTCGAA ACCTACCAAG CCACCCACGA TCCGTTTTAT CGCCGAATTG CTGAGGAAAG CATCAACTAC ATTTTGCGCG ATATGACGAG TCCCGATGGC GGTTTTTATG CTGCCGAAGA TGCTGATAGC GAAGGCGAAG AGGGCAAGTT TTATGTTTGG AGCTTAGCTG AAATTCAGCA ATTGCTCAGC CCTGAGGATG CGGCCCTTGC CCAGTTGTAT TGGAATATTC AGCCCGAAGG CAATTTTGAG GGCCATGCGA TTTTGTATGT GCCCCAAGAT CCCAGTGTGG TTGCCAAAGA GTTGAGCATT AGCGAGGCAG ATTTGGCCCA GCGGATTGCC GTAATTCGTG CTACGCTCTT GGCCCAGCGT AATACCCGCA TTCGCCCAGG CCGCGATGAA AAGATTTTGG CCTCGTGGAA TGGCATGATG CTGCGCAGTT TGGCCTTTGC TGCCAATGTG CTCGATAACG CCGATTATCG CGCTGCGGCG ATTCGCAACG CTGAATTTAT TACCAGCAAG CTGTATCAAA ACGGCCAACT GTATCGCTCC TATAAAGATG GTCAAGCCAA ATTCAAGGGT TACCTCGAAG ATTATGCCTG TGTTGCCGAT GGAATGCTGG CCTTGTACGA GGCAACGTTT GATCTGCGCT GGTTGCAAGT GGCGATTGAA TTGGCCGAAA GCATGACTGA GCGCTTCTGG GATGCGCAAC AACGCAGCTT TTTCGATACG GCCAGCGATC ATGAACAGTT GATCACACGG CCCCGCGACC TTTACGACAA TGCTACGCCT GCCGGTAATT CGGTGGCGGT TGATGTGTTG CTGCGTTTGG CAACCCTGCT TGATCGCTAC GAATATCGCC AATATGCTGA AACGGTGTTG GCGAATTTGA GCGGTGCGTT GCTCCAACTG CCTGGGGCAT TTGGGCGCTT GCTGGCTGCC GCCGATTTTG CGCTTGCTGA GCCACGCGAA GTTGCCTTAA TTGGCGATCC AGCTGATCCT GCGTTCAAAG CGTTGTTGCA AGCGACCTAT CGCAACTACC AGCCCAACAA AGTCGTGGCT GCTTGCAAGC CCGATGATCA CGCGGCTCAG CAGCTAATTC CATTGTTGGC TGAACGACCG TTGCTCAACC AACAAGCCAC GGCGTATGTG TGTGTGCGGC GGGCGTGCAA GTTGCCAACC AACGATCCAA ATGAATTAAT CAAACAATTA GGCTAA
|
Protein sequence | MANRLIHETS PYLLQHAENP VDWYAWGEEA LQRAKQDDKP ILLSVGYSAC HWCHVMAHES FEDPATAAVM NELFVNIKVD REERPDIDSL YMAAVQAMTR HGGWPMTVFL TPDGAPFYGG TYFPPEPRHN MPSFQQVLHG VAEAYRDRRE EVFQSAEQMR EHLEDILSFD LEQVKLSKSQ LNVAAQRQMS QFDSRFGGYG GAPKFPQALI FGMVLRTWLR SEDQDALNQV TQTLQAMANG GMYDQLGGGF ARYSVDAQWL VPHFEKMLYD NALLSQLYLE TYQATHDPFY RRIAEESINY ILRDMTSPDG GFYAAEDADS EGEEGKFYVW SLAEIQQLLS PEDAALAQLY WNIQPEGNFE GHAILYVPQD PSVVAKELSI SEADLAQRIA VIRATLLAQR NTRIRPGRDE KILASWNGMM LRSLAFAANV LDNADYRAAA IRNAEFITSK LYQNGQLYRS YKDGQAKFKG YLEDYACVAD GMLALYEATF DLRWLQVAIE LAESMTERFW DAQQRSFFDT ASDHEQLITR PRDLYDNATP AGNSVAVDVL LRLATLLDRY EYRQYAETVL ANLSGALLQL PGAFGRLLAA ADFALAEPRE VALIGDPADP AFKALLQATY RNYQPNKVVA ACKPDDHAAQ QLIPLLAERP LLNQQATAYV CVRRACKLPT NDPNELIKQL G
|
| |