Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1684 |
Symbol | |
ID | 4069352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2040831 |
End bp | 2042552 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983692 |
Product | ASPIC/UnbV |
Protein accession | YP_590759 |
Protein GI | 94968711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.478922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGC GAATCGGGAA GATCCTGGCC GCGAGTATCT TGGTGTGCGC GGCCTTCGTT GCGACGGCCC AGGCGCAGAT CACCTTCAAG GACATCACCC AGCAGGCGGG AATCCACTTC ACGCACACCA ATGGTGCGAC GGGGAAGAAG TATCTGCCGG AGACGATGGG GCCGGGCTGC GCGTTCCTTG ACTACGACAA CGACGGATAT CCCGATGTGC TGCTGATTAA CGGGAAGACT TGGGCGCCAG GCAGCAGCAG CACGATGAAG CTCTACCACA ACAACCACAA CGGGACGTTT ACCGATGTGA CTGCGAAAGC AGGCTTGTCG GTACCGATGT TCGGGCTCGG GGTTGCGGTG GGCGATTACG ACAACGATGG CTATGACGAT CTTTTCGTTA CTGCACTAGG ACAGAGCCAT CTATTCCACA ACAACGGGAA CGGCACTTTC ACCGACGTGA CGAAGCAGGC AGGGATGCTG GGGCCGAATG AGTTCAGCAC CAGCGCAGCG TGGGTGGACT ACGACAAGGA CGGCAAACTC GATCTTGTCG TCGCGAATTA TGTGCAATGG ACGCCTGAGA CGGACATCTA TTGCACGCTG GATGGCAGCA AGAAGTCGTA CTGCACGCCT GAGGCTTATA AGGGCGCATC GGTGCGGCTC TGGCACAATC TTGGCGGCGG CAAATTCGAA GATGCAACCG CGAAGTCCGG GCTTTTTGAT TCGACTTCGA AGTCGCTTGG CATCGCAGTG GCCGATGTGA ATGGCGACGG CTGGCCGGAC CTCATCGTCG CTAACGACAC GCAGCCGAAC AAACTCTACG TCAATCAGAA GAACGGTAAA TTCCAGGAGA GTGCGGTTTC TGCGGGCATT GCGTTCAGCG AAGATGGTGT TGCGCGCGCT GGGATGGGAG TGGACGCCGC CGACTACGAT CGGTCTGGCA AACCCAGCAT CATCATTGGC AACTTCTCGA ACCAGATGAT GGCGCTGTAT CACAACGAGG GCAACACGCT ATTCGTGGAT GAAGCGCCAC GCTCGGAAGT CGGCCGCAAG AGTTTGCTCA CGCTTGGGTT TGCCTGCTTC TTCTTTGACT ACGATCTCGA CGGTTGGCCC GATATCTTCG TGGCGAATGG ACACATTGAG CCGGAGATCG AGAAGATCCA AAAGCGCGTG AAGTACTCGC AGCCTTCGCA CCTCTTTCAC AACCAGGGGA ATGGACAGTT CACCGAAGTC ACGGGACAGG TGGGAACTGC ACTGGGGACC CCGAAAGTCG CTCGCAGCGC GGCGTATGCC GATATCGACA ACGATGGAGA TCTCGATCTG CTGATCACTA CGAACGGTGG TCCGGCGATG CTGCTTCGCA ATGACGGCGC GACGAACAAG AGCCTGCGCA TTAAGCTCGA CGGGACACGG TCCAATCGCG ACGGCATCGG CGCAGTAGTG ACCGTGCGCG CTGGGAACGA CAAGCAGGCG CAGATGCTGC GCAGCGGTTC CGGCTATCTT TCCGCGAGTG AATTGGTGCT GACATTCGGA CTTGGCCAAC ACGCAACCGC TGATGAGGTT CAGATTACCT GGCCCAGCGG TCAGGTCGAT CATCTCGCGG GTGTTGCTGC AGGACAGACT GTCGTCGTGA AAGAGGGCGG GTTGGTTGAG CAGAAGCGGC CGTACGGCGC TTCTGCAGCA GCTCCGAAGG CGGCACGGGC GAAGATCAAA GGTGGGAAGT GA
|
Protein sequence | MNKRIGKILA ASILVCAAFV ATAQAQITFK DITQQAGIHF THTNGATGKK YLPETMGPGC AFLDYDNDGY PDVLLINGKT WAPGSSSTMK LYHNNHNGTF TDVTAKAGLS VPMFGLGVAV GDYDNDGYDD LFVTALGQSH LFHNNGNGTF TDVTKQAGML GPNEFSTSAA WVDYDKDGKL DLVVANYVQW TPETDIYCTL DGSKKSYCTP EAYKGASVRL WHNLGGGKFE DATAKSGLFD STSKSLGIAV ADVNGDGWPD LIVANDTQPN KLYVNQKNGK FQESAVSAGI AFSEDGVARA GMGVDAADYD RSGKPSIIIG NFSNQMMALY HNEGNTLFVD EAPRSEVGRK SLLTLGFACF FFDYDLDGWP DIFVANGHIE PEIEKIQKRV KYSQPSHLFH NQGNGQFTEV TGQVGTALGT PKVARSAAYA DIDNDGDLDL LITTNGGPAM LLRNDGATNK SLRIKLDGTR SNRDGIGAVV TVRAGNDKQA QMLRSGSGYL SASELVLTFG LGQHATADEV QITWPSGQVD HLAGVAAGQT VVVKEGGLVE QKRPYGASAA APKAARAKIK GGK
|
| |