Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1633 |
Symbol | |
ID | 4072520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1981254 |
End bp | 1983158 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983642 |
Product | hypothetical protein |
Protein accession | YP_590709 |
Protein GI | 94968661 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1232] Protoporphyrinogen oxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00578112 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.776269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA AAAAGCGGGA CTGCGCCCTT GGCATGGACC GCAGCATTAC GCGCCGCGAC TTCCTGAACG GCGTGGCCCT CACCGTCGGT GGCGCACTGG TCGCGCCGAA TCTCCTCAAC GCAACCGAAA AAGGTTCCAG TTCCGAGTAT TACCCTCCGG CGCTGATGGG CTTGAGGGGT AATCACGAGG GGACCTACAC CTACGCTCAC GAACTTCGAG ATGGTGTCTT TCAGGAAAGC GCGCAGCCGC TAAAGACAGA TGAAGACTAC GATCTCGTGA TCGTTGGCGG CGGCATCAGC GGCCTCGCGG CAGCACATCT CTATCGCAAG AAGGCCGGCA AGAACGCAAA AATCCTGATT CTCGACAATC ATGACGACTT CGGAGGACAT GCGAAGCGCA ATGAATTTCG CGCCGCGAAT CGAATGCTAC TCGGCTACGG TGGAACCCAG TCAATCGAGA GCCCGTCAGA GTACAGTCCG GCTGCCAAGC AGGTCTTGAA AGATCTCGGC ATCGAGACCA AACGGTTCTA CAAGGATTAC GACCAGAAGC TTTATTCCCA CCTCGGCACA GCGAATTTCT TCGACAAAGA AACTTTTGGG CAGGACAAGC TCGTTACCGC GATGTTCGAA ACCCCTTGGC AGGAGTGGGT GAAGCAAACT CCCCTCTCGG AAGCCGCGAA GCGTGATATC GCGCGCGTTT ATACCGAGAA GGTCGACTAC CTTCCGGGTC TGAGCCCGAA AGAGAAACGT GCGAAGCTCG CCAAAATCAG CTACGCCGAT TACCTCACGA AATATGCGAA GTGCACGCCG GAAGTGTTGC CGTTTTTCCA GTCGCGAACT AACGATTTGT TCTGCGTGAA CATCGATGCC GTACCCACGC TCGCGATCCT CGAAGCCGGT GATGATTATG GCATCCCCTA CGCGGGGCTC GACGGCCTCG GCTTCGGCAA TCAAAGCCAA GGGCGAAGCG AGCATAAGCA AGAGCCGTAC ATCTTCCACT TCCCCGACGG CAACGCCTCT GTCGCGCGTC TGCTCGTGCG TGCCCTCATG CCGGGTGCGA TCCCCGGCGA CAGCATGGAG GACGTCGTCA CCGCCAAGGC CGATTACAGC ACTCTCGACC GCGCCGACTC ACCCGTACGC ATTCGCCTCA ACAGCACCGT GGTCCGGGCG AAACACGTTG GCGACGTGGC CACATCGAAG CAAGTTGAAG TGCAGTACAT GCGAGACGGG AAGTTGCAGA GCGTTACCGG CAAAGCGTGC ATCATGGCTT GTTACAACAT GATGGTTCCT TATCTCTGCC CTGAATTACC GCAGGTGCAG AAAGATGCGC TCGCCGAGGG CGTAAAAGCT CCCCTCGTCT ACACTCACGT GGCAATACGC AATTGGGATA TCTTCGACAA ACTGAAGATG TGGCAAGTCT GCTGTCCCGG CAGCTACCAC GTATATGTCG CGCTCGATTT TCCAGTTAGC ATCGGCGAAT ACAAATTCCC GAGCAAACCC AGCGAGCCCA TGGTGCTCTT CATGTTGCGC ACGCCTTGCA AGCCGGGCCT ATCGCAGAAG GACCAGTATC GAGCTGGTCG CATGGAGCTT TTCACGACGC CATACGAGAC GTTCGAGCGT AATATTCGCG AGCAGCTATC GCGGATGTTC GGCCCTTATG GGTTCGATTC CGCCCGCGAT ATTGAAGGCA TTACGGTGAA CCGGTGGGCG CATGGCTACG CCTATGGCTA CAACTCGCTC TTCGATCCGG ATGTTCCTGA AGATCAGCGT CCACACATCA TTGGGCGCAA ACAGTTCGGC CGGATTTCGA TTGCGAACTC TGATGCCGCG GCGACCGCTT ACACCGATGC CGCCATTGAC ATGGCTGATC GCGCAGTCAA AGAAGTGCTC GCGTTGAAGA GTTAG
|
Protein sequence | MNDKKRDCAL GMDRSITRRD FLNGVALTVG GALVAPNLLN ATEKGSSSEY YPPALMGLRG NHEGTYTYAH ELRDGVFQES AQPLKTDEDY DLVIVGGGIS GLAAAHLYRK KAGKNAKILI LDNHDDFGGH AKRNEFRAAN RMLLGYGGTQ SIESPSEYSP AAKQVLKDLG IETKRFYKDY DQKLYSHLGT ANFFDKETFG QDKLVTAMFE TPWQEWVKQT PLSEAAKRDI ARVYTEKVDY LPGLSPKEKR AKLAKISYAD YLTKYAKCTP EVLPFFQSRT NDLFCVNIDA VPTLAILEAG DDYGIPYAGL DGLGFGNQSQ GRSEHKQEPY IFHFPDGNAS VARLLVRALM PGAIPGDSME DVVTAKADYS TLDRADSPVR IRLNSTVVRA KHVGDVATSK QVEVQYMRDG KLQSVTGKAC IMACYNMMVP YLCPELPQVQ KDALAEGVKA PLVYTHVAIR NWDIFDKLKM WQVCCPGSYH VYVALDFPVS IGEYKFPSKP SEPMVLFMLR TPCKPGLSQK DQYRAGRMEL FTTPYETFER NIREQLSRMF GPYGFDSARD IEGITVNRWA HGYAYGYNSL FDPDVPEDQR PHIIGRKQFG RISIANSDAA ATAYTDAAID MADRAVKEVL ALKS
|
| |