Gene Haur_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1081 
Symbol 
ID5732870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1237412 
End bp1238533 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content50% 
IMG OID641278219 
Productmulticopper oxidase type 3 
Protein accessionYP_001543857 
Protein GI159897610 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000738853 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGCA ACGAACAACA ACCTATCGAA CAAGTATTAA ATGCCCCAAC CTCGCGCCGC 
AGCTTCTTGC GCTGGACAGG GATGGCGGCA GCCGCAGGAA CCATGGTCGC ATGTGGCCGC
GAAAGTGCGC TAACTGTTGA GCCAGCAACC GTACCACCAG CAAGTGCCAC GACCGCTGCC
GCAGGCACCG ATCATAGCAA TGGTGGCCAT AATAGCACTG GCAATGCTGG AACTCCTACC
ACCGACAGCG GGCTAAAAGC ATGGGAAGAA ATGGATAAGA TGCATGAAGC CGGGGTTAAG
CTGTTCCCAG CTAAAACCGA AGGCCTGAGC ATGCAGCCCC TCGAATATCG CATGGAAGGC
GATGTAAAAG TTTTTGAATT GACCTGCGAA AAAACCATGT GGGAAGTTGA ACCAGGCCGC
AAACTTGAAG CTTGGACCTA TAATGGTCAA TTGCCTGGAC CAGAAATTCG CGTCACTGAA
GGCGACAAAG TGAAGATTTT GGTCACTAAT AACCTTGATG AAAGCACCGC CGTTCACTGG
CACGGTTTGT ATGTGCCCAA CAATCAAGAT GGTGTGCCAT TTATCACTCA GCCACCAATC
ACACCTGGCT CAACCTATAC CTATGAGTTT ACGGTGCGCA ACTCTGGCTC GCATATGTAT
CACTCGCACC ATAACTCGAC CAAACAAGTT TCAATGGGCT TGCTTGGGCC ATTTATCGTT
GAGCCAAAAG ATAAGAGCAA AGATCCTGCA TCGGACAAAG AATTTATTTT GGTGCTGAAT
GATACCGCCC AAGGTTTCAC GATCAACGGC AAAGGCTTCC CAGCCACCCA ACCATTGACT
GCCAAATTGG GGCAAAAAAT TCGCATTCGC TATATGAACG AAGGCTTGAT GATTCACCCA
ATGCACTTGC ACGGCTTGCC CCAGTTGGTT TTTGCCAAAG ATGGCTGGAA CTTACCCCAA
CCATACATGT GCGATACGCT CAACGTCGCG CCAGGCGAAC GCTGGGATGT AATTGTCGAT
TGTACTGACC CAGGTGTCTG GGCCTTCCAC TGCCACATTT TGTCACACGC CGAAAGTGAA
CACGGCATGT TTGGGATGGT TACAGCGCTA ATCGTCGAAT AG
 
Protein sequence
MDRNEQQPIE QVLNAPTSRR SFLRWTGMAA AAGTMVACGR ESALTVEPAT VPPASATTAA 
AGTDHSNGGH NSTGNAGTPT TDSGLKAWEE MDKMHEAGVK LFPAKTEGLS MQPLEYRMEG
DVKVFELTCE KTMWEVEPGR KLEAWTYNGQ LPGPEIRVTE GDKVKILVTN NLDESTAVHW
HGLYVPNNQD GVPFITQPPI TPGSTYTYEF TVRNSGSHMY HSHHNSTKQV SMGLLGPFIV
EPKDKSKDPA SDKEFILVLN DTAQGFTING KGFPATQPLT AKLGQKIRIR YMNEGLMIHP
MHLHGLPQLV FAKDGWNLPQ PYMCDTLNVA PGERWDVIVD CTDPGVWAFH CHILSHAESE
HGMFGMVTAL IVE