Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0346 |
Symbol | |
ID | 4069588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 377827 |
End bp | 380265 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982349 |
Product | cell surface glycoprotein |
Protein accession | YP_589425 |
Protein GI | 94967377 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0984059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.138773 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCGC GCCACACATT TCTCCCGGGA GCCGCCATCG TCGCGACTCT ATGCGCCTCG CTTGCCACCT TCGCCGCCAC CCCAAGCTCG CCGGAGCGTT TCAACCGCTA CGGCGACAGT CCGATGTTTT TTGAACCCAA TGTAGGGCAA TCGACCGACG TGGTCCGCTT TATCGCCCAC GGTTCCCGCT ACGGTCTGTT CCTCACGAAC GAAGGCGCCA CGCTCGACTT GTCGCGCGAC GCCCGGCATT CGGTGGCGTT CAAGATGACC TTCTCCGGAG CGGCGCAACC CGGCTCGCTC TCCGCCGAAC AACCGCTCCC CTCCCAATCC AACTACTTCG TCGGCGATAG CAAGAATTGG CACACGGGGG TCGCCAACTT TGCGCGCGTC CGTTATAGCT CTGTGTATCC AGGCATTGAT GCCGTCTTCT ATGGAAACCA GCGGCAACTC GAATACGATT TCGATGTCGC GGCCGGCGCC GATCCTTCGC AGATCGGCCT GCACATCGCG GGCGCGGACA AGCTTTCTCT AACTGCTGAT GGAGCTCTCG AGATCCATGC CGACGGCCGG AGCATCGTGT TTCATGCGCC CATCGCCTAC CAGCAGGTGA ATGGTACCCG TCACAGTGTG GCCAGCCACT TTGTGCTGCG GGGCAAGGAC GAGATCGGCT TTGCCATGGA CCCGTACGAC AAGACACGCT CCCTGGTGAT TGACCCCACG CTGGTGTACT CAAGCTACCT CGGGGGAAGC GGAGATGACG AAGGCGATGC GGTCACCGTG GATGGATACG GCTATACCTA TGTCACGGGA GGAACAGTTT CGCTCGATTT CCCGCACACT GCAGGCGCGT ACCTCAAGGG AAGTGCTTAT CGCATCTTCG TAACGAAAAT AAAACCAGAT GGCACCGGGC TCGTCTATAG TGCGACCATC GGCGGATCAA AAGGTCTGTA TGGAAACGCC GGATTACAAA CAGGCAGAGC CATCGCGGTG GATGATCTCG GCGCAGCTTA CGTCGTTGGC GACGTCGACT CTACCAACTT CCCGATTACG AGTGATGCGA TCGAGCCCGC CAACGCCACC ACCGCCGGAC TTGTGGGCGT TGCGTTCGCT CTCAGTCCCA ACGCCAGCCG CTTGTTGTAC TCCACTTATG TCGGCGGGAA TACCTACGCG TCGCGGGTAA ACGGGGTCGC GCTCGATCGA TCGCGAAACG CTTATATTAC GGGCTTCACG TCGTCTTCCG GATTGCCCGT AACGCCGGGA GCTTTTCAGA CGGTGGCGAA ATCCGGCGAA GAAGCGTTTG CGGCGAAGAT CAATCCCAGC GGCTCGGCCT ACGTCTATGC CACCTACCTC AGCGGCAACG ACACCGACCA GGGCACGGCG ATTGCGGTGG ATGGCAACGG GAGCGCTTAC GTCACCGGAT GGACGGGCTG CGCGACGTTT CCTTCGACGC CGAACGCGTT CCAGCCGACC TGCCAGGGAC CCTATGATGT GTTCGTCGTG AAGCTGAACG CGAGTGGCTC CGCGCTGTCG TATGGGACGT TCCTGCATGC GCCGATTGAC TTCAACCTCG GGCCGACGGG AATTGCGCTC GACGTTCACC GCAATGCTTA CGTGGTGGGC AGGGCCAATG CAGGTCTGCC GACCACGCCC GGCGCTTTTC AAGCATCGGC GCCCGGCAAC GGCGATGGCT TTGTGGTGAA GCTGAATGCG ACCGGCTCAA CCGAACTGTA TGCGACGTAC CTGGGCGGCT CCACCGGCGA CGACTTTGCC ACCGGTGTAA AGGTCGATCT GAGTGGACGG GCCTATCTGA GCGGCCACGT CATTGGAACC ACCGACTTCC CGGTCACCGC CAACGCGTAT CAATCCACGC TGCACACCTT CGGCGGTCCG GTTGCGAGCA CACGCAACGC CTTCCTTACG CGATTGAATG CCAACGGTAC CGGACTCGAC TACTCCACGT ACTTCGGCAC GCGGTATGCC CTCGCCCTGA GCTTGGCCCT CGACCTGAAG AACAATGTGG TTCTTACAGG TCTGACAAGT TATAACGACA TTCCCATCAC GGCGAATGCC TTCGACAAGG TCGCCAGCAA CAACGGGGCA CTGGAGGCAT GGGTGGCGAA GTTCTCGTTC GGGACCACGG GAACCTGTAC GCCTGCGCAG TCGGGGGCGC TTATTTGCTC TCCGGTAGAG GGTGACACCG TTGGCACGAC CGTACCGGTG ACGGCGGGTG CGACCGCCGA GCCCGGGCTT TACATTAAAT CCATACGCTT CTACGTGGAT AATGTCGCGA AAGCGACCGT TTCCACTTCG GGGAACCCCA CCAGTTTCGA AACCTCGAAG TCGCTTACCC TGACGCCGGG CACGCACCGT ATCTCGATCG TCGCCTTCCA GAGCGCCAGC GTAGGTCTTA CCGCCTCCGT GACCGTCAAC GTGCAATGA
|
Protein sequence | MNPRHTFLPG AAIVATLCAS LATFAATPSS PERFNRYGDS PMFFEPNVGQ STDVVRFIAH GSRYGLFLTN EGATLDLSRD ARHSVAFKMT FSGAAQPGSL SAEQPLPSQS NYFVGDSKNW HTGVANFARV RYSSVYPGID AVFYGNQRQL EYDFDVAAGA DPSQIGLHIA GADKLSLTAD GALEIHADGR SIVFHAPIAY QQVNGTRHSV ASHFVLRGKD EIGFAMDPYD KTRSLVIDPT LVYSSYLGGS GDDEGDAVTV DGYGYTYVTG GTVSLDFPHT AGAYLKGSAY RIFVTKIKPD GTGLVYSATI GGSKGLYGNA GLQTGRAIAV DDLGAAYVVG DVDSTNFPIT SDAIEPANAT TAGLVGVAFA LSPNASRLLY STYVGGNTYA SRVNGVALDR SRNAYITGFT SSSGLPVTPG AFQTVAKSGE EAFAAKINPS GSAYVYATYL SGNDTDQGTA IAVDGNGSAY VTGWTGCATF PSTPNAFQPT CQGPYDVFVV KLNASGSALS YGTFLHAPID FNLGPTGIAL DVHRNAYVVG RANAGLPTTP GAFQASAPGN GDGFVVKLNA TGSTELYATY LGGSTGDDFA TGVKVDLSGR AYLSGHVIGT TDFPVTANAY QSTLHTFGGP VASTRNAFLT RLNANGTGLD YSTYFGTRYA LALSLALDLK NNVVLTGLTS YNDIPITANA FDKVASNNGA LEAWVAKFSF GTTGTCTPAQ SGALICSPVE GDTVGTTVPV TAGATAEPGL YIKSIRFYVD NVAKATVSTS GNPTSFETSK SLTLTPGTHR ISIVAFQSAS VGLTASVTVN VQ
|
| |