Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4077 |
Symbol | |
ID | 4072499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4827118 |
End bp | 4828551 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637986108 |
Product | hypothetical protein |
Protein accession | YP_593151 |
Protein GI | 94971103 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.844566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCCA ATGCAGTTGA TGCTCTCAAG CAAGCCGCGC GCAGCGTAAT TGCGGTCATT TGCGCGTTCT TACTCGTGCC CGGCGATGCA GTGATATGGG CGTCGCCAGC CCCTCAAGAT CAACAGGCGC AAGCCCCGGC GCAGGACGAT GCCGCAGCGA AACTGCCTCC AGACCAACTC GAATCTCTGG TGGCGCCGAT CGCGCTCTAT CCCGATCCGC TGCTTAGCCA GATGCTCGTC GCCTCGACCT ACCCACTGGA GATCATTCAA CTTCAACAGT GGCTCGCGAA GAATAAGGGT TTGAAAGACA AGGCTCTCTC GGATGCCGCA ATGAAGCAGC CCTGGGATGC GAGCGTTCAA GCCATGGCAG TGCTGCCCGA CGCCGTTAAG CAATTGTCCG AGAACATCCA GTGGACAACG GACCTCGGCA ATGCCTTCCT TGCACAGCAG GAAGACGTGA TGAACGCCGT GCAACGCATG CGCTCGAAGG CGAAGGACAA AGGTGCCCTG AACTCGAACG AGCAGATGAA GGTCGAGACG CAGACGGTCG AGAACAAGCA GGTGATCGTG ATCCAGCCGT CGAGCCCGGA CGTCGTGTAC GTGCCGAGCT ACAACCCGAC CGTCGTCTAT GGTGCGCCGG CATATCCGTA TCCTCCGATG TACTATCCGC CCGCACCAGC CGGTGCCTAC CTCGCTACCG CCGCAATTTC GTTCGGTGTT GGACTGGCCG TTGGCGCCGC TTGGGGCGGT GGTGGCTGGG GTTGGGGCGC TGGCTGGGGA CACGGCGACG TCAATGTAAA CGTCAACAAC AACTTTAATC GGAATACCAA CGTTCAGGGC GGCAACCGCA CGAATGTCGG CAACGGCAAC CGCACCAACG CTGGCAACGG CAATCGTGGT GGCGGCGCGA ACGGCGGCAA GTGGCAGCAC AATCCATCCC ACCGCGGCGG CGCTCCTTAC GGTGATCGTG GTACCGCGAA CAAGTTTGGC GGTTCGTCAC GCGGCGATTC CCGCGGACAA GGCGCTGGTA ATCGCGGCGG AGCCAATGCG GGCAATCGCG GCGGAGCGGG CGCAGGAGAT CGCGGCGGCA ATCGCGGCGG AAACAACGCA GGAAATCGTG GCGGAGCTTC CGCCGGCACC TCAGATCGTG GCGGTAATCG CGGCGGCGGC GGCGGGCCTT CAGCGGGCAC GTCGGACCGT GGCGGCAATC GTGGCGGCTC CTCAGCCGGT GGCGGTTCCC GAGGTGGAAG TTCACCGAGC CGAAGCGGTG GCAGCAGCAG CGCCTTCGGT GGTGGCGGCG GGTCTAGAAG CAGTGGCTCT TCGGCACGCG CCAGCAGTTC GCGCGGCTCC TCCAGCATGG GCGGTGGTGG TGGCTCCCGC GGCGGCGGCG GCTCCCGTGG TGGTGGTGGT GGCGGCGGCG GACGGCGGAG ATAA
|
Protein sequence | MKANAVDALK QAARSVIAVI CAFLLVPGDA VIWASPAPQD QQAQAPAQDD AAAKLPPDQL ESLVAPIALY PDPLLSQMLV ASTYPLEIIQ LQQWLAKNKG LKDKALSDAA MKQPWDASVQ AMAVLPDAVK QLSENIQWTT DLGNAFLAQQ EDVMNAVQRM RSKAKDKGAL NSNEQMKVET QTVENKQVIV IQPSSPDVVY VPSYNPTVVY GAPAYPYPPM YYPPAPAGAY LATAAISFGV GLAVGAAWGG GGWGWGAGWG HGDVNVNVNN NFNRNTNVQG GNRTNVGNGN RTNAGNGNRG GGANGGKWQH NPSHRGGAPY GDRGTANKFG GSSRGDSRGQ GAGNRGGANA GNRGGAGAGD RGGNRGGNNA GNRGGASAGT SDRGGNRGGG GGPSAGTSDR GGNRGGSSAG GGSRGGSSPS RSGGSSSAFG GGGGSRSSGS SARASSSRGS SSMGGGGGSR GGGGSRGGGG GGGGRRR
|
| |