Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0780 |
Symbol | |
ID | 4069525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 963353 |
End bp | 965593 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637982786 |
Product | hypothetical protein |
Protein accession | YP_589859 |
Protein GI | 94967811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0567586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.143797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTGC GCCTCGATTT CGTAGATGAT TTCCTCGTGT CATCCATGCC ATCCAATGTG GCTTCGCAGT CCGCAAGTCC GCCGCGCGGC GAATCTGTGC GCTATTGGAT CGCCGTTGGC CTCGCGCTTT CCTACGCATT CTTCGCCGGA CTGAAGACCG CGGCCGATCC CGATCTCGGC TGGCAACTTG CCGCCGGTCG CTGGATGCTT GAGCACCACC AGATCCTGCG CACCGACGTC TTCACCTACA CCGGCTTCGG ACGCGAATGG ATTTACCCGG CCCTTTCGCA AATCTTCGAA TACCTCCTGT ATCGCATCGG CAGCTACTCG CTGCTCTCGT GGACAGCCGC TGTTGGTTGC GTCGCGACCG TAGCGCTTCT CCTGCATGGA GCACGCTTTG CCACCGCAAT CATCGCGATT CTCGCCATAC CTCTACTTGC AGCGCGTTGC GTCATGCGCG CTGAACTCTT CTCGGTCATT CTCTTCGCTG CGTTCGTCTC CATCCTTTGG AACTTTCATC GCTCCCGTCG CGGACTGCTG TGGATCCTGC CGCTCTTGAT GGCGCTTTGG GTGAACCTGC ACCTCGGCTT TTTGGCGGGC TTCGGGATGT GCGCAGCCTA CGTGTTGCTC GAGATTGGCG AGCTCTTCAC GCTGCAGAAA CGCAGTGATG CCCTTTCTCG TCTCCGCTCC GCCGCGCCAT GGCTGCTCGC AACCATACCC GCGACCCTGC TCAATCCCTG GGGATGGCGC GTCTACGCTG GCATGTTCCG CCTCATGCCG ACGGGCACCA ACCCCTTCAT CCTCGAACTC ATGCGCGTCC GCGTGAACTC GACCACAGCA ATGCAAGCCT TCGCGTGGCG CGACTACGAG AGCGCGTTCT TCTGGTTCCT CGCCATCGCC GCCATCTGCA CCGTTGCAGC TCTCATGCAA CGTCGTTTCG CCGAAGCCGT CATCCTCGTA GGATCCGTGT ACGCTGCGGT TCACGCATCG CGGTTCGTAG CGATGTTCGC CATCATCGTC GTCGTAATCG GCGGGTCCGT TTTCACCGAT TGGGTGCCCC ATGTCTCGCG ATTTTCGAGA CGTGGGGATT TCCCCGACAT CTCCACGAAA GCAGCCACCA TCGTCCTCGT CCTCTCGGCG TTTCTAGTCG CGGTCCGAAT CTCCGATCTC GTCACCAACC GCTTCTACCT GCGCACGCCC GGCCAATACT CCGTCTTCGG CGCAGGCGCT CCCATACGCT TCCCCGCTGG CGCCGCGGAT TTCATCGTTC GTAACCACCT CCCCGCGAAC GTCTTCAACG ACTACAACTC TGGCGGCTTC CTCATGGGCA AGCTCGCTCC CGAATACCGT CTCTACCTCG ACGGCCGCGG CGAACTCGAA CCCGGCCTCT ACGTCCACGC GCAGCAACTG CTGACACAGT CGCTCGACTC GCAAGACTGG CAGCGCGAAG TCGCATCGCG CCACATCAAC ACGGTCGTCG TCTCCCTCGA CCGCGAATAC GGCATGGGCC TCGCGAGCCT CAACAAGTTC TGCAACAGCC CCGGATGGAA GCCCGCTTAT CTCGATCCAT TTGGCGCTGT CTTCGTAAAT ATGGGTGCCC CACCGTCGGG GTCCCCGGAG AGCGCCGCCT TTGCGCTTTC TGGGGTGGCA GGCTTCCCGA AGGGTGAGTG GGAAGAGACG CAACTCGACT GCTCCCAAGT TCGCTTCGAC GCTCCTCCAA CTGGCGACAG CTTCCGCGCT CGCGCCGATC GCTTCAACTA CCTCCTCAAC AGCGCCGCAA TCCTCATCGT TCTCGATCGC ACCGCCGAAG CGCTCTCTGC CTTGCAAAGC GCCGAGGCCG TCGAGTCGCA AAATGCATTC CTCCACTACG CCAAAGGCGC CGCGCTTCTC CAATCCGGTC GCTGGAACGA GTCGGAGTCT TCGCTCCACA CTGCCGTGAA TCTCGGTTCC GACGAAGCCG CTTCCGCGCT AGCCCGCGCC TACGACCAGC AAGGCCGCTA CCCCGACGAA GTCGCAGTCC TGCATCTCGC CGCCTCCCGC GCACCGCAAC CAAGTTGGTT CTACCTCAAG CTCGGCCTCG CCGAACTCGC GCAAAACCAC GCCCGCGAAG CCCTCGACGG ATTCCACAAT GCCGAGCGCG AAGACCCGTT TAACGGCGGG GACGACGCCG GCACCGGCTA CCATTCCCAA CTCGCCGAGG GCCATGCCCG CGCCGAAGCG CTGCTACAAT CGCAGCGCTA A
|
Protein sequence | MGLRLDFVDD FLVSSMPSNV ASQSASPPRG ESVRYWIAVG LALSYAFFAG LKTAADPDLG WQLAAGRWML EHHQILRTDV FTYTGFGREW IYPALSQIFE YLLYRIGSYS LLSWTAAVGC VATVALLLHG ARFATAIIAI LAIPLLAARC VMRAELFSVI LFAAFVSILW NFHRSRRGLL WILPLLMALW VNLHLGFLAG FGMCAAYVLL EIGELFTLQK RSDALSRLRS AAPWLLATIP ATLLNPWGWR VYAGMFRLMP TGTNPFILEL MRVRVNSTTA MQAFAWRDYE SAFFWFLAIA AICTVAALMQ RRFAEAVILV GSVYAAVHAS RFVAMFAIIV VVIGGSVFTD WVPHVSRFSR RGDFPDISTK AATIVLVLSA FLVAVRISDL VTNRFYLRTP GQYSVFGAGA PIRFPAGAAD FIVRNHLPAN VFNDYNSGGF LMGKLAPEYR LYLDGRGELE PGLYVHAQQL LTQSLDSQDW QREVASRHIN TVVVSLDREY GMGLASLNKF CNSPGWKPAY LDPFGAVFVN MGAPPSGSPE SAAFALSGVA GFPKGEWEET QLDCSQVRFD APPTGDSFRA RADRFNYLLN SAAILIVLDR TAEALSALQS AEAVESQNAF LHYAKGAALL QSGRWNESES SLHTAVNLGS DEAASALARA YDQQGRYPDE VAVLHLAASR APQPSWFYLK LGLAELAQNH AREALDGFHN AEREDPFNGG DDAGTGYHSQ LAEGHARAEA LLQSQR
|
| |