Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2533 |
Symbol | |
ID | 4072177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2989891 |
End bp | 2993022 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984550 |
Product | hypothetical protein |
Protein accession | YP_591608 |
Protein GI | 94969560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.693881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATTT GCTCGCGCTT CCTGGCGCTG GTTTTTGCGC TCCTGGTTAT CTCTTCTTCT CTTCATGCTC AGCAGTATTC CGAACAGAAT TTTGGCGCGA TGAAGTGGCG GCAGATTGGG CCGTTCCGCG GCGGACGTGT GCTGGCCGTG ACGGGCGTGC CGGGGGATCC GGCGACGTTT TACTTTGGCG CTGTCGCGGG CGGCGTGTGG AAGACCTCCG ATGCGGGCGG GACATGGAAG CCGATTGCGG ATAAGGACGG GATTGTTTCC GTCGGGGCGA TTGCGGTTTC TGAGAGCGAT CACAACGTGC TGTATGTGGG GACCGGCGAG GCTTGCATCC GAGGGAACAT TACTTACGGC AACGGCGTGT ACAAATCCGT GGATGGCGGG CAGAACTGGC AGCACATTGG ACTGGAAGAC ACGAGGCAGA TTGGGCGGCT AATTGTTGAT CCGAAGAACC CGAACCGCGC GTTTGTGGCG GCGCTGGGAC ATGCGTTTGG GCCGAATGCG GAGCGCGGTG TGTTCCGCAC GATTGATGGC GGTAAGACCT GGGAGAAAGT TCTCTACAAG GACGATCAGA CGGGCGCGAT CGATGTGCAG TTCGATCCGA ACAACGCGAA TACCGTGTAC GCGGCGCTGT GGCAGGTGGT GCGCAAGCCC TGGAACATGA GCAGTGGCGG GCCGGGGAGC GGGCTGTATA AGTCCACTGA TGGCGGCACG ACTTGGAAGC GACTGGAAGG GCATGGGCTG CCGGAGGGGA TTTACGGACG GATTGGGATT GCGGTGGCGG CGAACTCGAC GCGGGTGTTT GCGCTGATCG AGGCGAAAGA GGGCGGCATC TTCCGCTCCG AAGATTCGGG CGCGACGTGG ACGCGCATCA ACGATGATGA GCGCTATCGG CAGCGCGCGT GGTACTTCAC GCACATCTTT GCCGATCCCA AGAACATTGA CACCGTGTAT GTGCTGAATA CCGGTGCGTT CAAATCTACG GACGGCGCGA AGACGTTTGA TCTTTTGCCG GCGCCGCATG GCGATCATCA TGGGCTGTGG ATTGATCCGC AGAACAGCGA TCGGCTGATC AACAGCAATG ATGGCGGGGC GACGATCTCG CTCGATGGCG GGAAGACGTG GTCGACGCAG CAGAACCAGC CGACGGCGCA GTTCTACCAC ATTGTGGCGG ACAACCGGTA TCCGTACTAC CTGTACGGCG CGCAGCAGGA CAACTCGACC GTCGGGATTG CAACGATGGA TGAGCAGGAA GGCGTGATTG GGCGCTGGGA TTGGTACGCG GTGGGCGGCG GCGAGAGCGG CTATATTGCG CCGGACCCGA ACAACGCAAA CATTGTGTAT GCGGGCGATG GCGGAGGCGT GGTGACGCGC TACGACCGCT CGCGCGAGAC CATCCAGGAC ATTTCGCCGT TCCCACTGGA TACGTCGGGA CAGGGCGCGG ACAAGCAGAA GATCCGCTTC CAGTGGACGG AGCCGATCAT TATCTCGCCG CACGATCCGA ACACGATTTA CACGGCGGGC GATCGCATCT TCAAAACTAC CGATCGTGGA CAGAGCTGGA AGGAGATTTC GCCAGACCTG ACGCGGAATG ATAAGTCGAA GCAGACGCCA TCGGGTGGGC CAATCACGCT GGACATTACG ACTGTGGAGT ATTACGACAC GGTGTTCACG GTGGCGGAGT CGCCGAAGCA GAAGGACTTG ATCTGGGCGG GCACAGACGA CGGACTGATG AAGCTGACGC GCGATGGCGG CGCGCACTGG GAAGACATTA CGCCGAAAGC GATGCCGGAG TGGAGCACGG TGAGCCTGGT GGAGGCGTCA CCGTTTGATG CGGGGACGGC GTATATCGCG GTGGATCGGC ATAAGCTGGA TGACATTAAG CCGTACATCT ACAAGACGCA CGATTTCGGC AAGACCTGGA CGGCGATCAC TGCAGGTATT CCGGAGAATG CTTACGTGCA TGCCGTGCGT GAGGACACGG TGCGCAAGGG ACTGCTCTTC GCCGGAACGG AGAAGGGCGT TTATGTGTCG TTCAACGATG GGGCGAATTG GGAGCGGCTG CAACTGAATC TGCCGGTGGT GCCGATTCAT GACCTCGTGA TTCATGCGAA TGATTTGTCG GTGGCGACGC ATGGGCGGTC GTTCTGGGTG CTGGATGACA TCACGCCTCT GCGGGAGTTG GACGGTGGGA ATGCGGAGGC GGTGCTGTAC AGGCCGCGGG AAAGCCATCG AGTGCACTAT CCGGATGGAG TGGATCGGCG GCGACCTGTG GGGGACAATC CGCCGAACGG AGCGACCTTC TACTATTACT TGAAGGATGC GCCGAAAACT GAGGCTACGC TGGAGATTCT GGATTCGAGC GGCAAGCTGG TAAAGAAGTT CTCAGACCGC GAAAAGAAGG CCGCCAACGA GCAGCCGCAA GAGTGGCCGG ACCTGGAAGC GCCTCCGAAC TTGATTCCGG CGAAGGCTGG GTTGAATCGT TTTGCGTGGA ATTTGCGGTG GGAGGATCCG ACGCAGACGC CGAGCGCGGT GTACGAAGGC CTGCCGCCGC AAGGACCTGT GGCAGCGCCG GGGAAGTACA CGATCCGGCT CACAGTGGAC GGGCGGAAGT CTGAACAGCC GTGGGAACTG AAGGCGGATC CGCGGGATTC TGCGGACGTC GCGCAGGGGA TCGAACAGCA GGTGGCGTTC GAACTTGAAG TGCGCGAGCG CATCACGAAA CTGCATACGG CGGTGAACCA GATCCGCGAC CTGCGGGAGA AGCTGGAGAC ACTGAAGAAG TGGGTGGGCG AAAATCCGCA AGGGAAGCAG TTGCTGGAAC AGGCGGAGGC GCTCGATAAG AAGATGTCGG GCGTGGAGGA ACAGCTGATC CAGGTGAAGC TGAAGAGCAC GGAAGGGAAC CTGCGGTATC CGAACATGCT GAACGAGCAG TGGGCCACGT TTGCCGCGTT CATTGATATT GCGGATGCGC CGCCGACTAC GCAGGAGAAG TCGGTGTACG AGTATCTGTC TCAGCAATCG GATGCAAATA TCGCCAGGTG GGAAGAGATT CGGAAGACGG ATGTGCCTGC TTTGAATGAG GCGATGCAGA AGAGCGGGGC GGTTAGGCTG GGGGTGGAAT AG
|
Protein sequence | MRICSRFLAL VFALLVISSS LHAQQYSEQN FGAMKWRQIG PFRGGRVLAV TGVPGDPATF YFGAVAGGVW KTSDAGGTWK PIADKDGIVS VGAIAVSESD HNVLYVGTGE ACIRGNITYG NGVYKSVDGG QNWQHIGLED TRQIGRLIVD PKNPNRAFVA ALGHAFGPNA ERGVFRTIDG GKTWEKVLYK DDQTGAIDVQ FDPNNANTVY AALWQVVRKP WNMSSGGPGS GLYKSTDGGT TWKRLEGHGL PEGIYGRIGI AVAANSTRVF ALIEAKEGGI FRSEDSGATW TRINDDERYR QRAWYFTHIF ADPKNIDTVY VLNTGAFKST DGAKTFDLLP APHGDHHGLW IDPQNSDRLI NSNDGGATIS LDGGKTWSTQ QNQPTAQFYH IVADNRYPYY LYGAQQDNST VGIATMDEQE GVIGRWDWYA VGGGESGYIA PDPNNANIVY AGDGGGVVTR YDRSRETIQD ISPFPLDTSG QGADKQKIRF QWTEPIIISP HDPNTIYTAG DRIFKTTDRG QSWKEISPDL TRNDKSKQTP SGGPITLDIT TVEYYDTVFT VAESPKQKDL IWAGTDDGLM KLTRDGGAHW EDITPKAMPE WSTVSLVEAS PFDAGTAYIA VDRHKLDDIK PYIYKTHDFG KTWTAITAGI PENAYVHAVR EDTVRKGLLF AGTEKGVYVS FNDGANWERL QLNLPVVPIH DLVIHANDLS VATHGRSFWV LDDITPLREL DGGNAEAVLY RPRESHRVHY PDGVDRRRPV GDNPPNGATF YYYLKDAPKT EATLEILDSS GKLVKKFSDR EKKAANEQPQ EWPDLEAPPN LIPAKAGLNR FAWNLRWEDP TQTPSAVYEG LPPQGPVAAP GKYTIRLTVD GRKSEQPWEL KADPRDSADV AQGIEQQVAF ELEVRERITK LHTAVNQIRD LREKLETLKK WVGENPQGKQ LLEQAEALDK KMSGVEEQLI QVKLKSTEGN LRYPNMLNEQ WATFAAFIDI ADAPPTTQEK SVYEYLSQQS DANIARWEEI RKTDVPALNE AMQKSGAVRL GVE
|
| |