Gene Acid345_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1633 
Symbol 
ID4072520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1981254 
End bp1983158 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content57% 
IMG OID637983642 
Producthypothetical protein 
Protein accessionYP_590709 
Protein GI94968661 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00578112 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.776269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA AAAAGCGGGA CTGCGCCCTT GGCATGGACC GCAGCATTAC GCGCCGCGAC 
TTCCTGAACG GCGTGGCCCT CACCGTCGGT GGCGCACTGG TCGCGCCGAA TCTCCTCAAC
GCAACCGAAA AAGGTTCCAG TTCCGAGTAT TACCCTCCGG CGCTGATGGG CTTGAGGGGT
AATCACGAGG GGACCTACAC CTACGCTCAC GAACTTCGAG ATGGTGTCTT TCAGGAAAGC
GCGCAGCCGC TAAAGACAGA TGAAGACTAC GATCTCGTGA TCGTTGGCGG CGGCATCAGC
GGCCTCGCGG CAGCACATCT CTATCGCAAG AAGGCCGGCA AGAACGCAAA AATCCTGATT
CTCGACAATC ATGACGACTT CGGAGGACAT GCGAAGCGCA ATGAATTTCG CGCCGCGAAT
CGAATGCTAC TCGGCTACGG TGGAACCCAG TCAATCGAGA GCCCGTCAGA GTACAGTCCG
GCTGCCAAGC AGGTCTTGAA AGATCTCGGC ATCGAGACCA AACGGTTCTA CAAGGATTAC
GACCAGAAGC TTTATTCCCA CCTCGGCACA GCGAATTTCT TCGACAAAGA AACTTTTGGG
CAGGACAAGC TCGTTACCGC GATGTTCGAA ACCCCTTGGC AGGAGTGGGT GAAGCAAACT
CCCCTCTCGG AAGCCGCGAA GCGTGATATC GCGCGCGTTT ATACCGAGAA GGTCGACTAC
CTTCCGGGTC TGAGCCCGAA AGAGAAACGT GCGAAGCTCG CCAAAATCAG CTACGCCGAT
TACCTCACGA AATATGCGAA GTGCACGCCG GAAGTGTTGC CGTTTTTCCA GTCGCGAACT
AACGATTTGT TCTGCGTGAA CATCGATGCC GTACCCACGC TCGCGATCCT CGAAGCCGGT
GATGATTATG GCATCCCCTA CGCGGGGCTC GACGGCCTCG GCTTCGGCAA TCAAAGCCAA
GGGCGAAGCG AGCATAAGCA AGAGCCGTAC ATCTTCCACT TCCCCGACGG CAACGCCTCT
GTCGCGCGTC TGCTCGTGCG TGCCCTCATG CCGGGTGCGA TCCCCGGCGA CAGCATGGAG
GACGTCGTCA CCGCCAAGGC CGATTACAGC ACTCTCGACC GCGCCGACTC ACCCGTACGC
ATTCGCCTCA ACAGCACCGT GGTCCGGGCG AAACACGTTG GCGACGTGGC CACATCGAAG
CAAGTTGAAG TGCAGTACAT GCGAGACGGG AAGTTGCAGA GCGTTACCGG CAAAGCGTGC
ATCATGGCTT GTTACAACAT GATGGTTCCT TATCTCTGCC CTGAATTACC GCAGGTGCAG
AAAGATGCGC TCGCCGAGGG CGTAAAAGCT CCCCTCGTCT ACACTCACGT GGCAATACGC
AATTGGGATA TCTTCGACAA ACTGAAGATG TGGCAAGTCT GCTGTCCCGG CAGCTACCAC
GTATATGTCG CGCTCGATTT TCCAGTTAGC ATCGGCGAAT ACAAATTCCC GAGCAAACCC
AGCGAGCCCA TGGTGCTCTT CATGTTGCGC ACGCCTTGCA AGCCGGGCCT ATCGCAGAAG
GACCAGTATC GAGCTGGTCG CATGGAGCTT TTCACGACGC CATACGAGAC GTTCGAGCGT
AATATTCGCG AGCAGCTATC GCGGATGTTC GGCCCTTATG GGTTCGATTC CGCCCGCGAT
ATTGAAGGCA TTACGGTGAA CCGGTGGGCG CATGGCTACG CCTATGGCTA CAACTCGCTC
TTCGATCCGG ATGTTCCTGA AGATCAGCGT CCACACATCA TTGGGCGCAA ACAGTTCGGC
CGGATTTCGA TTGCGAACTC TGATGCCGCG GCGACCGCTT ACACCGATGC CGCCATTGAC
ATGGCTGATC GCGCAGTCAA AGAAGTGCTC GCGTTGAAGA GTTAG
 
Protein sequence
MNDKKRDCAL GMDRSITRRD FLNGVALTVG GALVAPNLLN ATEKGSSSEY YPPALMGLRG 
NHEGTYTYAH ELRDGVFQES AQPLKTDEDY DLVIVGGGIS GLAAAHLYRK KAGKNAKILI
LDNHDDFGGH AKRNEFRAAN RMLLGYGGTQ SIESPSEYSP AAKQVLKDLG IETKRFYKDY
DQKLYSHLGT ANFFDKETFG QDKLVTAMFE TPWQEWVKQT PLSEAAKRDI ARVYTEKVDY
LPGLSPKEKR AKLAKISYAD YLTKYAKCTP EVLPFFQSRT NDLFCVNIDA VPTLAILEAG
DDYGIPYAGL DGLGFGNQSQ GRSEHKQEPY IFHFPDGNAS VARLLVRALM PGAIPGDSME
DVVTAKADYS TLDRADSPVR IRLNSTVVRA KHVGDVATSK QVEVQYMRDG KLQSVTGKAC
IMACYNMMVP YLCPELPQVQ KDALAEGVKA PLVYTHVAIR NWDIFDKLKM WQVCCPGSYH
VYVALDFPVS IGEYKFPSKP SEPMVLFMLR TPCKPGLSQK DQYRAGRMEL FTTPYETFER
NIREQLSRMF GPYGFDSARD IEGITVNRWA HGYAYGYNSL FDPDVPEDQR PHIIGRKQFG
RISIANSDAA ATAYTDAAID MADRAVKEVL ALKS