Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3907 |
Symbol | |
ID | 4072244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4621969 |
End bp | 4623192 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985933 |
Product | hypothetical protein |
Protein accession | YP_592981 |
Protein GI | 94970933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0452365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00418649 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCCGGCA TGCCGGACAT GAATATGCAT CCGCATGGCG GCATGTCCGC AAATACATTC GTCGAAGCAA TCGAGAACCA CAGCAGTTCT GGAAGCAGCG TGGAGCCGAT CTCCACTCCA GTGGACATGA TGATGTCCCT GCACAAAGGC TGGATGCTTA TGCTGCACGG CGAACTCTTC CTGAATGCAA TTCAGCAGAG CGGACCACGT GGTGGTGACA AAGTCTTCTC TACCAACTGG ATCATGCCGA TGGCGCAGCG AAGTCTCGGG CCCGGCACCT TGACGGTCCG TACGATGCTT AGCCTGGAGC CGGCGACGGT CACCCAGCGC CGCTATCCTG AGCTTTTCCA AGTGGGCGAA ACGGCGTTCG GCAATTCCAT TGTGGACGGC CAGCATCCGC ACGACTTGTT CATGGAGATC GCCGCACTTT ACGACATCAA ACTTTCGAAA GATGCACTAC TCTCGTTCTA TGCCGCGCCG GTCGGCGATC CAGCGATCGG GCCCATCGCC TATCCCCACC GTATTTCAGC GTCGGAGGAC CCTCTCGCGA CGCTTGGCCA TCACCTCCAG GACTCGACCC ACGTTGCCGC CGACGTATTC ACCGGAGGGT TCACCTACAA GATGGTGCGG CTTGAAGCTT CGGGCTTTCA CGGGCGCGAA CCAGATGAGA ATCGCTGGAA CATCGACCAG GGAACACTCG ATTCATACTC CGCGCGCATC ACCATCGTAC CTGCGAAGAA CTGGTCCGGC CAGTTTTCGG CGGCACATAT CGTGAGCCCA GAAGTTATTG CGCCCAACGA AGACCAGCTC CGCATGACCG CCTCCGTGAG CTACAACCGT CCGCTCCCGC GAGGCAATTG GGCCAGCACT GTTTTATGGG GACGCACGCG CACTCTCGGC CAATCGCAGC CCTTCAATGG ATATCTCGCG GAGAGCACCG TGAAATTTGC GGAGAAGAAC TACGTCTGGG GGCGCATTGA GAACGTAGAC CGCAGCACGG AATTGCTCGA ACTCCCGTCA GCGGGAGAGG GGTTCCTCGC ACGCGTACAG GCCTACACCA CGGGCTACGA GCGGACGTTC CACGTAGTCG ACCGTGCGGA AACAGGACTT GGCGCGCAGG TCACCTTCTA CGCGAAGCCG GATTTTCTCA CGCCGAGCTA CGGCGATCAT CCCACGGGCG TGGTGGCTTT CTTGAAGATC CGTTTGCGCG GCAAAGGGCA GTAG
|
Protein sequence | MPGMPDMNMH PHGGMSANTF VEAIENHSSS GSSVEPISTP VDMMMSLHKG WMLMLHGELF LNAIQQSGPR GGDKVFSTNW IMPMAQRSLG PGTLTVRTML SLEPATVTQR RYPELFQVGE TAFGNSIVDG QHPHDLFMEI AALYDIKLSK DALLSFYAAP VGDPAIGPIA YPHRISASED PLATLGHHLQ DSTHVAADVF TGGFTYKMVR LEASGFHGRE PDENRWNIDQ GTLDSYSARI TIVPAKNWSG QFSAAHIVSP EVIAPNEDQL RMTASVSYNR PLPRGNWAST VLWGRTRTLG QSQPFNGYLA ESTVKFAEKN YVWGRIENVD RSTELLELPS AGEGFLARVQ AYTTGYERTF HVVDRAETGL GAQVTFYAKP DFLTPSYGDH PTGVVAFLKI RLRGKGQ
|
| |