Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2064 |
Symbol | |
ID | 4070606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2474450 |
End bp | 2475715 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637984078 |
Product | hypothetical protein |
Protein accession | YP_591139 |
Protein GI | 94969091 |
COG category | [R] General function prediction only |
COG ID | [COG1106] Predicted ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.906065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.239178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAATTC GATACAGCGT AGAGAATTTC AGATCCATCC GAGATCGACA GGAATTGTCC CTTGTAGCCT CAGCGTTGAA GGATCTCCCC AACGCCCTGA TTCGCGTCGA GGACTTTTCG CAATCGCTTT TGCCCGTTGT CGCGATCTAT GGCGCGAACG CTTCAGGAAA AACCAACTTG CTGAAGGCCC TGAGGTTCGT ATGTGACTCG GTCAGAGACT CTCAACGGCT ATGGGCGCCG GCCAATCAGA TTGAGCGACC ACCGTTTCGA ATGGACGAGT TCCGTTCGCG TGCATCCGCC TTTGAGGTTG ACTTCTTAGT TTCGGGTTCT CGGTTTCGTT ATAGCTTTTC GCTCAACGAC GAAGAAATTC TTTCCGAGTC ACTCCAAGCG TTTCCGCAAG GAAAATCGCA GCTGTGGTAT GAGCGCGAGA ACGGGAAATT TACATTTGGC CGGAGTCTCT ATGGAGAGAA CCGCACGATC GAGAGCTTGA CTCGACGAAA CAGCTTGTTT CTATCGGCTG CGGCGCAGAA CAACCACGTG TTACTTACCC CGATTTTCGA TTGGTTCTGC GAACAGGATT TTGTTCTTGG GGGGCGCTCG GTGTCACCGA ATGATCGTGT TGTCTCCCTA TGCAAAGATG ATTCACACAG AGACTTGCTG CTGAAACTGC TCGCTGCGGC GGATTTGGGC GTCACCGGGT ATCGCGTTGA AGAGTCGGAG TGGCCAGATG AAGCGAAGAA AATCTTCGAC CATGTGAAGC AGCTCTTACC CTCCGCGGTC AAGATGCCTG ATAAGTTAGC CACATTGATG TTGGTGCATG GTGCTGGCGA GAACGAGGCC GTCTTTCCCA TCACAGAGGA ATCCTCAGGA ACATCTGCTC TTCTCAAGAT TATCGGCCCA GCGATCAAGT CGCTTGAGCA CGGCAGTGTC CTTTGCGTGG ACGAACTCGA TGCAAGCTTA CATCCGTTGC TCGCGCTTCA CTTGGTCAAG ATGTTCAACG ATTCTGAAAC AAACCCCAAG GGTGCTCAGC TCGTTTTTAA TACGCATGAC ACCAACCTCT TGGATAACGA CGTGTTGAGA CGAGATCAGG TTTGGTTCAC CGAAAAGGAC AGGCGGGGCG CTACTCACTT GTACCCGTTG AGCGACTTCA GGCCCAGGAG GAACGAAAAT CTTGAACGAG GATATTTGCA AGGAAGATTC GGCGCTATTC CTTTTCTCGG AACGGCAAAA TTTCATCCCT TAGGGTCGTC AGAAGACAAT GAATAG
|
Protein sequence | MLIRYSVENF RSIRDRQELS LVASALKDLP NALIRVEDFS QSLLPVVAIY GANASGKTNL LKALRFVCDS VRDSQRLWAP ANQIERPPFR MDEFRSRASA FEVDFLVSGS RFRYSFSLND EEILSESLQA FPQGKSQLWY ERENGKFTFG RSLYGENRTI ESLTRRNSLF LSAAAQNNHV LLTPIFDWFC EQDFVLGGRS VSPNDRVVSL CKDDSHRDLL LKLLAAADLG VTGYRVEESE WPDEAKKIFD HVKQLLPSAV KMPDKLATLM LVHGAGENEA VFPITEESSG TSALLKIIGP AIKSLEHGSV LCVDELDASL HPLLALHLVK MFNDSETNPK GAQLVFNTHD TNLLDNDVLR RDQVWFTEKD RRGATHLYPL SDFRPRRNEN LERGYLQGRF GAIPFLGTAK FHPLGSSEDN E
|
| |