Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1081 |
Symbol | |
ID | 5732870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1237412 |
End bp | 1238533 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278219 |
Product | multicopper oxidase type 3 |
Protein accession | YP_001543857 |
Protein GI | 159897610 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000738853 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGCA ACGAACAACA ACCTATCGAA CAAGTATTAA ATGCCCCAAC CTCGCGCCGC AGCTTCTTGC GCTGGACAGG GATGGCGGCA GCCGCAGGAA CCATGGTCGC ATGTGGCCGC GAAAGTGCGC TAACTGTTGA GCCAGCAACC GTACCACCAG CAAGTGCCAC GACCGCTGCC GCAGGCACCG ATCATAGCAA TGGTGGCCAT AATAGCACTG GCAATGCTGG AACTCCTACC ACCGACAGCG GGCTAAAAGC ATGGGAAGAA ATGGATAAGA TGCATGAAGC CGGGGTTAAG CTGTTCCCAG CTAAAACCGA AGGCCTGAGC ATGCAGCCCC TCGAATATCG CATGGAAGGC GATGTAAAAG TTTTTGAATT GACCTGCGAA AAAACCATGT GGGAAGTTGA ACCAGGCCGC AAACTTGAAG CTTGGACCTA TAATGGTCAA TTGCCTGGAC CAGAAATTCG CGTCACTGAA GGCGACAAAG TGAAGATTTT GGTCACTAAT AACCTTGATG AAAGCACCGC CGTTCACTGG CACGGTTTGT ATGTGCCCAA CAATCAAGAT GGTGTGCCAT TTATCACTCA GCCACCAATC ACACCTGGCT CAACCTATAC CTATGAGTTT ACGGTGCGCA ACTCTGGCTC GCATATGTAT CACTCGCACC ATAACTCGAC CAAACAAGTT TCAATGGGCT TGCTTGGGCC ATTTATCGTT GAGCCAAAAG ATAAGAGCAA AGATCCTGCA TCGGACAAAG AATTTATTTT GGTGCTGAAT GATACCGCCC AAGGTTTCAC GATCAACGGC AAAGGCTTCC CAGCCACCCA ACCATTGACT GCCAAATTGG GGCAAAAAAT TCGCATTCGC TATATGAACG AAGGCTTGAT GATTCACCCA ATGCACTTGC ACGGCTTGCC CCAGTTGGTT TTTGCCAAAG ATGGCTGGAA CTTACCCCAA CCATACATGT GCGATACGCT CAACGTCGCG CCAGGCGAAC GCTGGGATGT AATTGTCGAT TGTACTGACC CAGGTGTCTG GGCCTTCCAC TGCCACATTT TGTCACACGC CGAAAGTGAA CACGGCATGT TTGGGATGGT TACAGCGCTA ATCGTCGAAT AG
|
Protein sequence | MDRNEQQPIE QVLNAPTSRR SFLRWTGMAA AAGTMVACGR ESALTVEPAT VPPASATTAA AGTDHSNGGH NSTGNAGTPT TDSGLKAWEE MDKMHEAGVK LFPAKTEGLS MQPLEYRMEG DVKVFELTCE KTMWEVEPGR KLEAWTYNGQ LPGPEIRVTE GDKVKILVTN NLDESTAVHW HGLYVPNNQD GVPFITQPPI TPGSTYTYEF TVRNSGSHMY HSHHNSTKQV SMGLLGPFIV EPKDKSKDPA SDKEFILVLN DTAQGFTING KGFPATQPLT AKLGQKIRIR YMNEGLMIHP MHLHGLPQLV FAKDGWNLPQ PYMCDTLNVA PGERWDVIVD CTDPGVWAFH CHILSHAESE HGMFGMVTAL IVE
|
| |