Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2032 |
Symbol | |
ID | 5733921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2528551 |
End bp | 2529453 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279176 |
Product | hypothetical protein |
Protein accession | YP_001544803 |
Protein GI | 159898556 |
COG category | [R] General function prediction only |
COG ID | [COG0384] Predicted epimerase, PhzC/PhzF homolog |
TIGRFAM ID | [TIGR00654] phenazine biosynthesis protein PhzF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.48523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGG TTAACGTGTA TCAAGTTGAT GCCTTCACTC AGCATAAATT TCGCGGAAAT CCTGCTGGGG TTGTGACACA TGCTGATGAT CTGAGCACCC ACGAGATGCA ACAACTGGCC CGCGAACTGA ATAACTCCGA AACGGTCTTT TTGCTTGCGC CGACTGATTC AAGTCATGCG GTCTGGCTAC GATTTTTTAC CCCTACCGTC GAAGTTCCAT TGTGTGGTCA TGCGACGATT GCGGCTCACT ATGTGCGTGC CCATGAGCTG CAATTAGGCT CATGCCAACT CCGACATAAA ACTGGAGCCG GGATACTGCC AATTGAAGTT ATCCAAGCCA ATAATGATTA TCAAATTATT ATGACTCAAG CTGCAATTGA GTTTGGCCCG ATCATTGATG GACAATCCAA ACAGGCTTTA CTTGATGCGC TTGGTCTGAC TATTCACGAC CTGCATTACG CATGCCCGAT TCAAATAGTC TCAACGGGGC ATTCCAAAGT ATTAATTGGG ATTAACCAGC GCACAAAGCT TGATCAACTG CGCCCGAATC TTGCAGCCCT TACCCTGCTG AGTAAACAAA TTGGCTGCAA TGGCTATTTT GTCTATACAT TAGACTCTGA TGATCCGGCG ATTCTTTCCT ATGGGCGGAT GTTTTCACCA GCGATTGGGA TCAATGAAGA TCCGGTTACG GGGAATGCAA ATGGGCCACT CGGAGCCTAC TTAGTGCATC ATCGATTGGT TGCCCACAAC GGAATCCATT TGCGGTTTAC TGCATGTCAA GGTGTTGCGA TTGGGAGGAC TGGGTATATT GAGGTTCAGG TGTTGATTGA AGCTGATGAG CCTGTTTTGG TTAAAGTTGG GGGTCATGCA GTGATTGTTT TTCAAACGAG CATTCAGTTG TAA
|
Protein sequence | MKQVNVYQVD AFTQHKFRGN PAGVVTHADD LSTHEMQQLA RELNNSETVF LLAPTDSSHA VWLRFFTPTV EVPLCGHATI AAHYVRAHEL QLGSCQLRHK TGAGILPIEV IQANNDYQII MTQAAIEFGP IIDGQSKQAL LDALGLTIHD LHYACPIQIV STGHSKVLIG INQRTKLDQL RPNLAALTLL SKQIGCNGYF VYTLDSDDPA ILSYGRMFSP AIGINEDPVT GNANGPLGAY LVHHRLVAHN GIHLRFTACQ GVAIGRTGYI EVQVLIEADE PVLVKVGGHA VIVFQTSIQL
|
| |