Gene Acid345_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4180 
Symbol 
ID4072139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4945185 
End bp4946975 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content58% 
IMG OID637986211 
Productmulticopper oxidase, type 2 
Protein accessionYP_593254 
Protein GI94971206 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.387832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.310383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGTGT GTCTCCGAGC GACGCTGGCG TTCTCCGTTA CCCTTTGCTC GCTCTCAGCG 
TTCTCTGCTG CTTCCGATCG TTGTCCTCGG CCTGCAATCG GCGGCGTGGT GCGTGAACCT
GCGGACTTAC GCAGCGTAAA TGGTGTACTC GACGCCAACC TCACAATTCG TAACTCAAAA
GAGCCCGACG GTTCGCTGCG TTATTGCTAC TTTGATGGCA AGGGGAACGA AGCGCCCACC
CTGCGCGTAA GTCCCGGCGA TCGCGTCGTG CTCCATCTGA AGAACGAACT GGTGGATCTT
GAGGGGACGC CTTCTGCGCA CGCTGCGCAT GCACGCGAAG CGGTGACTAA GTCGAAAGAC
CCATGCAGTA GCGCCATCAT GAGCCCGATC GCGACGAACC TGCATTTTCA CGGGCTGACG
ATTCCGCCGA AGTGCCATCA GGATGAGGTG TTGCACACTT CCATTCAGCC GGGCGACCCG
CCATTCACCT ACGAGTTTCA AATTCCGGCA GACGAATCCC CGGGGATGTA TTGGTATCAC
CCGCACATTC ACGGATTCGC GAAGACGGAA CTTCTCGGTG GGGCTTCGGG AGCGATCATC
GTCGAAGGCA CGGAGCGCGC AGACAAAGCT GTCGCGGGAT TAGCTGAGCG TGTGTTCGTG
GTGCGCGACC AGGATTTGCT ATATCCCGAC GCACCTCCGT CGAAGAAAGA GTCGGCTGTT
CCCGCGCTAT TGCTTGATAG CGATGGCGAC ACCGTGAATA CGGGCACAGG GGGCGGCAGA
CCGGCAAAGG ACTTGTCGAT TAATTTTGTG CCGGTGCCGT ATCCGGAGTA CGAACCGGCG
GTGATCGAGA TGAAACCGGG CGAGAAGCAG CTTTGGCGGG TGCTCAATGC TTCGGGCCTG
ACGTATCTCA ACCTCACCTT GTTGCGCGAT GGCTCCGCGC AGGAGGTGGG ACTCATCTCT
CTGGATGGCG TTCCGATGAA CACGAACGGC GGTCCCGCAG ACTTTGTGTA TTGGACGACA
CGGCTCGGTG TGCCTCCGGG ATCGCGAGTG GAATTTATCG TGAGCGGGCC TGCGGCGGGG
ACAACCGAGA TGCTCGTCAC CCGCACGGTG GATACTGGGC CGGGTGGCGA GAACGACCCC
AATCGCGCGC TTGCGGTTCT AAAGCCGGTT GCAAATGCCT CCGAATCTCG GGTAAAGCTT
GACGCAAAAC CACAACCGCT GCCGAGATCG AATAAACAGT GGCTCGGCGA CGTTGCGCCG
GTTCGGGTTC GCAAACTTTA CTTCTCGGAG AAACTACAGG ATCCGAACAA TCCCGCGAGT
GCCGATGAGT TTTACCTGAC GCCAGAGGGT GAGACGGCAC AGATGTTCGA CATGAACGCG
AAGACGCCGA ATATCGTAGT GAAACAGGGC ACAGTGGAAG ACTGGATCAT CGAGAACCGA
TCCAACGAAG TGCACGCGTT CCACATTCAC CAATTGCACT TCCTCTTAAT GGAATCAGGC
GGCGGTCCAG TGGACGAGCC GTACCTGCGA GACACCATTA ATGTTCCGTA CCACCAAGGC
AATCTGCCCG ACTTTCCCGC TATCCGCGTG CGCATGGATT TTCGCGATCC AAACATCGTT
GGGACATTTC TGTACCACTG CCATCTGCTC GATCACGAAG ACAAAGGCAT GATGGGCAGC
ATCGAGGTTC TTCCGGCCGA TTCGCCGCGG CAGGCTAGCA AGAACGGATT GTGCACCGGC
ACCGAGAGTG ATCCGTGCAG CAACTTCAAT ACTCCTCCGC CGACGAAATA G
 
Protein sequence
MIVCLRATLA FSVTLCSLSA FSAASDRCPR PAIGGVVREP ADLRSVNGVL DANLTIRNSK 
EPDGSLRYCY FDGKGNEAPT LRVSPGDRVV LHLKNELVDL EGTPSAHAAH AREAVTKSKD
PCSSAIMSPI ATNLHFHGLT IPPKCHQDEV LHTSIQPGDP PFTYEFQIPA DESPGMYWYH
PHIHGFAKTE LLGGASGAII VEGTERADKA VAGLAERVFV VRDQDLLYPD APPSKKESAV
PALLLDSDGD TVNTGTGGGR PAKDLSINFV PVPYPEYEPA VIEMKPGEKQ LWRVLNASGL
TYLNLTLLRD GSAQEVGLIS LDGVPMNTNG GPADFVYWTT RLGVPPGSRV EFIVSGPAAG
TTEMLVTRTV DTGPGGENDP NRALAVLKPV ANASESRVKL DAKPQPLPRS NKQWLGDVAP
VRVRKLYFSE KLQDPNNPAS ADEFYLTPEG ETAQMFDMNA KTPNIVVKQG TVEDWIIENR
SNEVHAFHIH QLHFLLMESG GGPVDEPYLR DTINVPYHQG NLPDFPAIRV RMDFRDPNIV
GTFLYHCHLL DHEDKGMMGS IEVLPADSPR QASKNGLCTG TESDPCSNFN TPPPTK