Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1710 |
Symbol | |
ID | 4072055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2075860 |
End bp | 2077428 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637983718 |
Product | hypothetical protein |
Protein accession | YP_590785 |
Protein GI | 94968737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.636106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.107463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACT TCTCCCGCTC TTCCCTAGCC GCGCTCCTGA TCGTCGCCGG CCTCTCGCTC ACCGGATGCA ACAAACAAAG CGCTCCGACG GCCAACGCAG CAGCACCGCA GCAACAACAA GCCGCTCAGC CCGACCAGTC GCAGGCTGCT CAGTCACAAA ATCCTGAGGA CAACGGCAAT CTCCCGCCAG TCGACGCCAA CGGCAACCCC ACCGACCAAC CCACGAGCGA CCAGCAGAGC TATCCTGCTC AGGATCAAAG CGCTAGCCAG CAACAACCGG CGCAGAACCA GGGCAATGCC AGTCAGCAAC AGCAGTATCC CGACCAGGGT TCTCAGCCAG CCCAGGCGCA AGCTCCCGCC TCGCAGAGCT ATCCCGACAA CAGCAACAAT GTTGGATACG GCGACAACAA TCAGGATTAC GACCAGGACC TGTCCCAACA AGACAGCAGC TATGGCCAGC CTGCAATTCA GGCCCATCAA GCGCCGCCCC CAATTCCCGA GTACCAGCAG CCCATGTGTC CCAGTCCCGG CTACGTCTGG ACTCCCGGGT ACTGGAGCTA TGCTCCTGCC GGATACTACT GGGTGCCCGG CGCCTGGGCG CGCCCGCCGC AAGTCGGCTT CCTCTGGACG CCCGGCTATT GGGGTTTCGG CGGCGGAGTC TATCGCTTCC ACTATGGCTA TTGGGGACAC TACGTAGGCT GGTACGGTGG CATTAACTAC GGCTTTGGAT ACGTCGGATC CGGCTACCAC GGTGGTTACT GGCACGGCAA CAACTTCTAC TACAACCGTT CCGTCAACAA CGTGAATGTC ACCAATATCA CCAACGTCTA CAACAAGACG GTCATCGTCA ATAACAACAA CCGCGTCAGC TACAACGGCC CCGGCGGCAT CACCCGCCGT CCCACGCGCG CAGAAGCAGT TGCTGTCCGT CAGCAGCGTA TCCCGCCGAT GACCACGCAG ATCGAGAACC AGCACAATGC CATGCGCGAT CGCCAACAGT TCGCGTCCGT CAACAAAGGA CGCCCCGCGA TTGCCGCTGC TCCCAGACCG ATCGAGGCGG CAAAGCCGGT CGCGCCCGCG ATCGCCGCAC GCCCGGTTCC GCGACCAGCT GCCGGCGCCA GGCCGAACCA ACCAAACAAC GTCGCACGTC CAACTCCGCA GCCAAGTACG CGTCCCACGC CCGTTTCTCC TGCTCGTCCC GAAGCGCGTC CTGTTCCGCG GCCCACCACC ACTCAACCGA GCGTGAAACC AACACCTCAG CCGAGCACGA GGCCCACTCC GCAACCTTCC ACCCGTCCAA CGCCGCAACC GAACACGCAC CCGGTTCCGC AACCAAAGCC CGCGACGCGA CCGACGCCGC AACCTTCCAC GAGGCCGACG CCTCAGCCGA ACACGCGGCC TACTCCGCAA CCAAAGCCGC CGACACATCA GGCACAGCCC AGCACACGCC CTGCGCCACA GCCCCACCCC GGGACGCAAC CGCCAGCCAA GCCTGCAACG CGGCAGGCGC CACAACAACA GAGCAGGCCT TCCAAAGACT CGAAGCCAGA TCGGCCCGAA CACCGATAG
|
Protein sequence | MLNFSRSSLA ALLIVAGLSL TGCNKQSAPT ANAAAPQQQQ AAQPDQSQAA QSQNPEDNGN LPPVDANGNP TDQPTSDQQS YPAQDQSASQ QQPAQNQGNA SQQQQYPDQG SQPAQAQAPA SQSYPDNSNN VGYGDNNQDY DQDLSQQDSS YGQPAIQAHQ APPPIPEYQQ PMCPSPGYVW TPGYWSYAPA GYYWVPGAWA RPPQVGFLWT PGYWGFGGGV YRFHYGYWGH YVGWYGGINY GFGYVGSGYH GGYWHGNNFY YNRSVNNVNV TNITNVYNKT VIVNNNNRVS YNGPGGITRR PTRAEAVAVR QQRIPPMTTQ IENQHNAMRD RQQFASVNKG RPAIAAAPRP IEAAKPVAPA IAARPVPRPA AGARPNQPNN VARPTPQPST RPTPVSPARP EARPVPRPTT TQPSVKPTPQ PSTRPTPQPS TRPTPQPNTH PVPQPKPATR PTPQPSTRPT PQPNTRPTPQ PKPPTHQAQP STRPAPQPHP GTQPPAKPAT RQAPQQQSRP SKDSKPDRPE HR
|
| |