Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2751 |
Symbol | |
ID | 4069442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3258554 |
End bp | 3259963 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984768 |
Product | hypothetical protein |
Protein accession | YP_591826 |
Protein GI | 94969778 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCGC CGGGAATGGC AACAAGGTGC CGGACGCAGT GTTGCGGCCC TGTGAATGAC ACCTTATGCG AGGCAGCGAT ACAACACGGC GTGCGAATGC CGCTTCCGTT CATATACGCG AGGGGGACGT TTGCCGTCCG CCGCGTCAGG AGGGGAACGT CCATGAAGAT TCGTCGCTTG CTTTATCCCG CGATGCTGCT GCTCATCGTG TTGGGCTTGT CGGGGCTCTC GTTCGCACAA TTCGGTGTCT CTATTTCTAT CGGTGTCGCG CCGCCACCAC TGCCGGTCTA TGACCAGCCG CCGATCCCCG CGGTCGGTTA CATCTGGACG CCCGGCTTTT GGGCATGGAG CGATGACGGC TACTACTGGG TTCCGGGAAC ATGGGTGCAG CCTCCCGAAG TCGGCCTGCT TTGGACGCCG GGCTATTGGG GCTGGAATGA TGACCAGTCG GCTTTCGCTT GGAACGATGG CTACTGGGGA CCCGAAGTCG GTTTCTACGG CGGAGTTGAT TACGGCTATG GCTACACTCC AGACGGCTAT TACGGTGGCG AATGGCGAGG CAGGGACTTT TACTACAACC GCGCCGTGAA CAACGTAACC AGCGTTCACA TCACGAATGT TTATGAGAAG CAGGTGGTAA CCCACACAAC CGTAGAGCGC GTGAGTTACA ACGGTCCAGG TGGTGCGCAG GCGAAGCCAA CGCCGAAGCA AGTACAGGCG AAGCAGCAGA AGAAGGTGGA TGTTACGCCC GTACAGAAAC AGCATGTGCA AGCAGCGAAG AGCAATCCAC AACTGCTGGC CAAGAACAAC AATGGCAAGC CGCCTGTAGC AGCCACCTCG AAGCCTGGCG ACATGAGCCA TGCCGTGCAG GCGAAAGCAG CCGGCGGCAA GATCGATCCG AAAGTGCTCC AGGCCAACGC CAAGACTGCG CCGAAGACGG CCAAACCTGC GGCGCGTCCG GAGGCGAACG CGGCGAAACA ACCTGCAAAG CCGGCAGAAA CCGGCAAGGC TGCCGAGACG AGCAAAGCCA CCGAAGCAAA CAAGCCCGCC GCGGAAACCG GCAAGGCCAG CGCAAGCAGC AAGCCGAGCA CGGAGCCTTC GAAGAGCACC GAGCGTACGG CAACGAAGCC TGCGAGTGGC GCTGAAAACA GCGGAGCTCG CGCCAAACAG ACGGAACCAG CTACGCCGCG TCCGGCAGCC AAGGAATCCG GATCGTCGGC CGGAATGCCG CAGTCGGAAG GTGCTGGTCA TCGTGCAGAA CCTGCCGGCA AGGCCGCGGA ACCCAAGGCA CAGCCTGCTG AGCATGCTCC TGCTGCCAAG GAAAAAGCTG CGCAGCCGGA AACGAACAAA CAGAAGCCGG AGAAGGCCAA GCCGGCATCT GAGAAAGACA AGCCGGAAGA ACCAAAATAA
|
Protein sequence | MAPPGMATRC RTQCCGPVND TLCEAAIQHG VRMPLPFIYA RGTFAVRRVR RGTSMKIRRL LYPAMLLLIV LGLSGLSFAQ FGVSISIGVA PPPLPVYDQP PIPAVGYIWT PGFWAWSDDG YYWVPGTWVQ PPEVGLLWTP GYWGWNDDQS AFAWNDGYWG PEVGFYGGVD YGYGYTPDGY YGGEWRGRDF YYNRAVNNVT SVHITNVYEK QVVTHTTVER VSYNGPGGAQ AKPTPKQVQA KQQKKVDVTP VQKQHVQAAK SNPQLLAKNN NGKPPVAATS KPGDMSHAVQ AKAAGGKIDP KVLQANAKTA PKTAKPAARP EANAAKQPAK PAETGKAAET SKATEANKPA AETGKASASS KPSTEPSKST ERTATKPASG AENSGARAKQ TEPATPRPAA KESGSSAGMP QSEGAGHRAE PAGKAAEPKA QPAEHAPAAK EKAAQPETNK QKPEKAKPAS EKDKPEEPK
|
| |