Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5300 |
Symbol | |
ID | 5737258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 94916 |
End bp | 96163 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641282464 |
Product | hypothetical protein |
Protein accession | YP_001548055 |
Protein GI | 159901810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.130766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACC CGCACGTATC GACGATTGAC ACCGCAACGA CCCCTCGCCG TGATCCGTTG ACGGTCTACC AATATATGCT TACAGGCCAA CCACTCAAGG ACAACAATGC CCAATTGCCC CAAACGGTTG AGGTCAAAGG CCAGCCGATG ACGGCGATTG GGATTGATCC GGGCAATGGC GAGATGAAAG CGGCGATGAT GGGCCTTGAT GGGCGGCTTG TCACGGTGCA AATTATTTCG GCCTACCGCA TCGCTGTGAC CCTTGGCGGT GGCAAAAGCC CCACGACCTA TACCGTGAAT GGTGGCCCCT CGTTTTGGAT TGGCCGTGAT GCCGTGCAAA TGAAGGGCGA TGCCTTACCG ATTGGCCCAA CGGCGGTGCG TTTAGAAGAT CCCCGCCAGA TTGACTTCTA TGCTGCGGGT GTCGTTGAAC TCTTGATCAA AGCACACTGC GCTCCTGGGC AATACACCCT CGCAACGGGC TTAGCCTTGC CCAATATGGA GATGCAAGCC CAGGTCAAGA AGAATGAAGC GGGAGAAGAG GTGGAAGTCT TTGGGGTGGT CGAGGAGAGC AAGCAGGCGA TCAAGGAGCA TATCTACGGC AAGAGCTACC ATGTGAGCCG TCTTGACGAA GATGGGGACG TGACCAATTG GCAGATCACC TTTGGCCAAG TCTATACCCA AGCCCAGAGC TATGGCACCT TTATGGCTCT CACGCACACC ATCTTTGGAA CCCGCCGCAC CGATGGCATT CAAGAGTATG CGATTATTGA CATGGGCCGT GGCGACACCC ACGAAACCCT CATCCAGTTA TCACCCACGT TCCGCATGAT GACGAAGCGC ACCGGCGAAG GCACGATCAA GCAAGCACGG GCGGTTGCAC GCGCCTTGGC GGAGTTTGAC TTGAATGATG CCCAAGCCCA AGAAGCCTTG ATCACGCGGA GTATTCTTGA TGGGGGACGG CCCAAATCGA TTAATCATGT CGTCGATAAA GTAGTAGAGC GCGAAACCCA AGAAATGCTC AGTCGCTTGT TACCCGCATT GAGAAATAGA AATGCCTTCA TTGCCTTTAC GGGTGGCGGC ACCAAGGACG CGACAACCTT GCAAATGATT AATGATCGGA TGGACAGCGT GGGCCGCAGC GCCGAGAGTT TTGTGATTGT GCATCCCGAA GTCGCCAGCG TCTTGAACGC GGTGGGAACC TTGTTGAAAG TCTTGTTTAC CGAGTTAGCA CGAAAGGGAC GGGCATAA
|
Protein sequence | MTNPHVSTID TATTPRRDPL TVYQYMLTGQ PLKDNNAQLP QTVEVKGQPM TAIGIDPGNG EMKAAMMGLD GRLVTVQIIS AYRIAVTLGG GKSPTTYTVN GGPSFWIGRD AVQMKGDALP IGPTAVRLED PRQIDFYAAG VVELLIKAHC APGQYTLATG LALPNMEMQA QVKKNEAGEE VEVFGVVEES KQAIKEHIYG KSYHVSRLDE DGDVTNWQIT FGQVYTQAQS YGTFMALTHT IFGTRRTDGI QEYAIIDMGR GDTHETLIQL SPTFRMMTKR TGEGTIKQAR AVARALAEFD LNDAQAQEAL ITRSILDGGR PKSINHVVDK VVERETQEML SRLLPALRNR NAFIAFTGGG TKDATTLQMI NDRMDSVGRS AESFVIVHPE VASVLNAVGT LLKVLFTELA RKGRA
|
| |