Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3937 |
Symbol | |
ID | 4071320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4656455 |
End bp | 4657885 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985963 |
Product | hypothetical protein |
Protein accession | YP_593011 |
Protein GI | 94970963 |
COG category | [R] General function prediction only |
COG ID | [COG1660] Predicted P-loop-containing kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.93988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCC TCGCCCTCCT CTTCGAGCAG CACTTCGGCA GTGCACCCAC GCGCATGCAT CCCGTGCAAG GCGGACTCGG CGGCTCCGGA CGCATCATTA CCCGCCTCGC CAACGACACG CATTCGGCCA TCGGCATCCT CAACGAAAAC ACCAAAGAGA ACGCTGCCTT CCTCGAGTTC TCTGACCATT TCCGGAAGTA CGACCTCGCG GTCCCCGAAA TCTATCGCGT AGCCGAGACT CGCAACGCGT ACCTCGAACA AGATCTCGGC GACACTACGC TCTTCCACTT CCTCGCGCGG AACCGTAGCG GGTCTGAGAT CGCTCCCGAG GCCGTCAACG CCTATCGCAA AGTTGTGGAA GCTCTGCCAC GCTTCCAGGT CGTCGCCGGG CGCGACCTCG ACTACTCCGT CTGCTACCCG CGCCCGAGCT TCGACCGCCG CTCCATCGCG TGGGACCTGA ACTATTTCAA GTACTACTTC CTTAAGCTGT CTGAAATTCC GTTCCACGAA GAGGCGCTCG AAGAAGATTT CGACAAGCTC ACCGAATACC TTCTCAGCGC TCGGCGCGAT TACTTCCTTT ACCGCGACTT TCAATCACGC AACGTCATGC TGCACGACGG CCAGCCCTAT TTCCTCGATT ACCAGGGTGG ACGCCACGGC GCGCTGCAGT ACGACATCGC TTCGCTCCTC TTCGACGCGA AAGCCGAGTT GCCACCCGCT TTGCGCGAAG AGCTGCTCAA TCACTATCTC GACGCACTTG CCGAGCACAT CCCTGTCGAT CGACAGGACT TCCTCGCGCA TTACTATCCC TACGTTTACA TCCGCATCAT GCAGGCACTC GGGGCCTACG GCTACCGCGG CTTCTTCGAG CGCAAAGTGC ACTTCCTGCA AAGCGTGCCG TATGCGTTGC AGAACATCCG GTGGCTGCTC CACAACGTAA CGCTGCCGAT CGAATTGCCT GCATTGATGG AAGCCTTCTC CGCCATGCTC GGCTCGGAAA AACTGCAGAA GCTCGCGATC ACCGAGAAGA AGGAGCTCAC GATCGTCGTC ACCAGTTTCT CTTTCCATCG CGGACCGGTG CAGGATGAGA GCGGCAACGG CGGCGGCTTT GTCTTCGACG CCCGTGCCCT CCCCAATCCT GGACGCGAGG AGCAATTCAA GAAGCTCAGT GGCCGCGATG CCGAAGTGAT CGAATATCTT GAGGCCGAAG AATCTGTCAG CCAATACCTC GAGAACGCGA TGAACATGGT CAACGCCAGC GTGCGCGCCT ACAAAAAGCG CCGCTTCACC CACCTGATGG TTTCGTATGG CTGCACCGGC GGCCAGCACC GCTCGGTCTA TCTCGCCGAG CAGACGGCGA AACGACTCGC CGGAATTGAC GGATTAAAAG TCATTCTGCG CCACCGCGAA GAGGAGAGTT GGGTCCGATG A
|
Protein sequence | MDTLALLFEQ HFGSAPTRMH PVQGGLGGSG RIITRLANDT HSAIGILNEN TKENAAFLEF SDHFRKYDLA VPEIYRVAET RNAYLEQDLG DTTLFHFLAR NRSGSEIAPE AVNAYRKVVE ALPRFQVVAG RDLDYSVCYP RPSFDRRSIA WDLNYFKYYF LKLSEIPFHE EALEEDFDKL TEYLLSARRD YFLYRDFQSR NVMLHDGQPY FLDYQGGRHG ALQYDIASLL FDAKAELPPA LREELLNHYL DALAEHIPVD RQDFLAHYYP YVYIRIMQAL GAYGYRGFFE RKVHFLQSVP YALQNIRWLL HNVTLPIELP ALMEAFSAML GSEKLQKLAI TEKKELTIVV TSFSFHRGPV QDESGNGGGF VFDARALPNP GREEQFKKLS GRDAEVIEYL EAEESVSQYL ENAMNMVNAS VRAYKKRRFT HLMVSYGCTG GQHRSVYLAE QTAKRLAGID GLKVILRHRE EESWVR
|
| |