Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3272 |
Symbol | |
ID | 4072684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3875888 |
End bp | 3877537 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637985293 |
Product | hypothetical protein |
Protein accession | YP_592347 |
Protein GI | 94970299 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000123414 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATGG ATAAAGAAGT GGCTCCCAGC CGTAAGTTAA TCGCAGGGGT GCTGCTCATG ATGCTCGTTG TGTCGCTTTC GGCGCTGGCG CAACAGCAAA CAGCAGCACC GTCTGCCAGC ACCACCACCT CAGCCCCCCA CAACGAATTA TTCGCTGGCT ATTCCTGGTA CGACCCGCGT GGCTATTACA CGGGAGATGT CGCTGGCGTG GCCTTCAAGG CTCCCTCGAT CACACCAGGT TTCGCCGCAG CCTATACGCG TAACTACGGC AACGTATTCG GCTTCACCGT CGATTACAGC GGTCACTTTG GCGATGCCAA CCACGTCAAC ACGTTCCTCG TGGGACCGCA GTTGAAGTGG CGTGCTGAGC ACTTCCAGCC CTTCGGCCAG ATACTGGCTG GCTTGGCTGT CATCAGCGCT CCCAATACCA TTCGCGGAAC GCAGTACCAG GGCGCAGTTG GCGCAGGCGG CGGTTTCGAC CTTCTGCTGA CTGAGAAATT CGGCATTCGC CTCCTGCAAG CCGATTACAT CTACACCAAT TGGGAACAGG GCGCGGCCAC GGGTACTCCG AGCCGTTGGA ACTCCGTACG CCTCCAGGGC GGCCTCCTTT ACCAGTGGGG CTTCAACCCG ACCGTTCCGG TTTCGGCCGC TTGCAGCGCG CAGCCTTCGT CGATCATGGC GGGCGAGCCT GTGAAGGTCA CCGCTACGGG ATCTAACTTC AACCCCAAGA AGACCGTCAG CTACGCGTGG ACGAGCACCG GTGGCAAAGT CAGCGGCACC GACGCAACCA CCACGGTTGA CACCAACGGT CTTGCCCCGG GTACCTACAC CGTGAAAGCG ACGCTCAGCG ATGGCGCGAA GAAGAACCCG AACGTCGCAG AGTGCAACGC GACCTTCACC GTCAACGAAC CGCCGAAGCA TCCGCCGACA ATTTCTTGCA CGGCGAATCC GTCGACCGTT CGCGCGGGTG ACGCTTGCAA CATCGCTTGC AACGGCAACA GCCCGGATGG ACGTCCGTTG ACCTACACCC ACAACGCGAC CGGTGGCCGC CTGACGCCTG ACGGCGCCAA TGCGACCCTC GATACGACGG GTGCAGCAGC GGGTCCGATC ACCGTTAACA GCACCGTGAG CGACGATCGC GGCCTTACCG CCTCGACTTC GTCTTCGTGC AGCGTGGAAG CTCCGCCGGC CGCTCCGACC GCGAGCAAGC TGAACGAAAT CACCTTCCCG AACGAGAAGA AGCCAGCTCG TGTGGACAAC ACCGCCAAGG CGATCCTCGA TGACGTTGCG CTGCGCCTGC AGCGTGAGCC GTCGTCCAAG GCCGTCGTGG TTGGTTACGC CACCGCGGAA GAAACCAAGA AGAAGGCCAA CGCAAACCTC GCCGCACAGC GCGCCGTTAA CACCAAGGCC TGCCTCGACG GTGAAGAGGT ATCTTGCGAG AACCAGAGCA AACAGATCGA CCCGAGCCGC ATTGAAGTTC GCACCGGCAC GGGCGACCAG AACAAGGCCG AGATCTGGAT CGTACCGTCC GGCGCCAGCT TCACCGGCGA AGGCACGACC CCGGTCGACG AAAGCAAGTT CAAGGCGCAG GCTCGTACCG CTGCCGGCGC TAAGGCTGCC AAGAAGGCCC CGAAGAAGGC TGCGAAGTAG
|
Protein sequence | MSMDKEVAPS RKLIAGVLLM MLVVSLSALA QQQTAAPSAS TTTSAPHNEL FAGYSWYDPR GYYTGDVAGV AFKAPSITPG FAAAYTRNYG NVFGFTVDYS GHFGDANHVN TFLVGPQLKW RAEHFQPFGQ ILAGLAVISA PNTIRGTQYQ GAVGAGGGFD LLLTEKFGIR LLQADYIYTN WEQGAATGTP SRWNSVRLQG GLLYQWGFNP TVPVSAACSA QPSSIMAGEP VKVTATGSNF NPKKTVSYAW TSTGGKVSGT DATTTVDTNG LAPGTYTVKA TLSDGAKKNP NVAECNATFT VNEPPKHPPT ISCTANPSTV RAGDACNIAC NGNSPDGRPL TYTHNATGGR LTPDGANATL DTTGAAAGPI TVNSTVSDDR GLTASTSSSC SVEAPPAAPT ASKLNEITFP NEKKPARVDN TAKAILDDVA LRLQREPSSK AVVVGYATAE ETKKKANANL AAQRAVNTKA CLDGEEVSCE NQSKQIDPSR IEVRTGTGDQ NKAEIWIVPS GASFTGEGTT PVDESKFKAQ ARTAAGAKAA KKAPKKAAK
|
| |