Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3940 |
Symbol | |
ID | 4071323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4660708 |
End bp | 4662174 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985966 |
Product | hypothetical protein |
Protein accession | YP_593014 |
Protein GI | 94970966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.634833 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACT ACGGCATCCC GTTTTTCAGC GTTGTGATCT TCGCGGTTTC ACTGCTGGTG GGATGTGGAT CGTCTCCGCT GGCGGTGCAA CCTGCGCCGC CGCTGAGCGC GGAAAATCTC AACCTGATTT TTGTCTCTAG CGAAGATTTG GCGCACCACG CTTCCGGCGA CGTCAGCGAG GCTACGGCAA ACCTTACGAA CCAAGGGCTG CAGAGAGTTC TGCTCAATGC AGCTTTCCTG CGCAAGAACG TGCTCGCTAA CCAGAATGTG AATGGAATCT ATGCGCTGGA GCCAATGACG CACTTGCAGA CGGCAAGCCA GTATCCCGAC ATGGCTGCGC TGGAGATGGC GCAACAATTC GCAGTGTTGA ACCAAGTCAC GTTGTCCAGT GACCAGTCCG GAGGCACGCC ATTCACGGGC CAAAACTTTC CCATCAATGC ATCGTATTCT CCGAACGCAG TGCCGCCGGA TGTGCTGGCT CCGCTGCAAT TCTGCCCGGC TTGCCAAGGG TTGGATTTTT CCGATGTGGG TGGTGACAAC GAAGCGGTCG TAAGCAAGGT CTTAACCGCG AAGACCCCGG GTTTCTACGT GTTCGTCGCG CCGTGGGAGA CGGTCCGCGA GTTAATGGTG AATGCCGATC GGACGGAAGG ATATGCCTTG CAACTTCCGG AGGAATATCC CGGGCCGAAC ACGATCTATG CCATAGCGGT TGCGCCTTCC GGCAGCGCAA GCCTCGTTGA CTATGACACC AAGGCAAATC CGGGCGCGTC TTATCCAACG CTGCCCGCGC CAGTTCCAAC TACCACCTGC ACGGTTCGGA CTCCCGAGAG TGTGACGGTA ACGGGTGGCG TAGATGGCGC GGTGGTTCCG GCGAATGCCA ATACCGACGA AGTGCTGTAC ATGATCCGGC ATGCGGAGGC GCACCCGCAG GGATATTGGT CGGACAACAA CTATGTCGCA GCCGGCAACT GGCGCGCATT GGCCCTTCCT TCCGCGTTGG AAGGCAAGAG CAATCCCGAC GAGGTGTGGT CAGGAGACCC GTCGTCGTTC GGAATGGGAA CGATGAGCAA TACGGGCCAA AACTATTTTT CAGGCGTAGC GCCACCCTTG ACCGTAGTGC CTTACGTGAT CGCTAAGGAC CTTCCCTATC ACCTGGTGGC TGGCTTCGAC ATGACGGCGG CGGCCTCGGC GTCGCAGAGC AGCCAGTTCT TCTTCACCGG AGGCAGGTTC TCGCAGCGCA AAGTACTTCT CGGGTGGATG TACGTTCAGA ACCAGCAAAT CATCAACGCT TTGTTCGCAA GCTATTATCC GAACGGGGGA GCGCCGGTGG TGCCGACGTG GTCTCCGTTG GATTACGACA GCCTCTGGAC GGTGACTTTC GATGGGCAGG GCAACTTCAC TGTCGATTAC TCCCGATGCG AAGGAATAGA TTCGGCGGCG CTGCCAGCGA CTGCGCCTCA GTTCTGA
|
Protein sequence | MKNYGIPFFS VVIFAVSLLV GCGSSPLAVQ PAPPLSAENL NLIFVSSEDL AHHASGDVSE ATANLTNQGL QRVLLNAAFL RKNVLANQNV NGIYALEPMT HLQTASQYPD MAALEMAQQF AVLNQVTLSS DQSGGTPFTG QNFPINASYS PNAVPPDVLA PLQFCPACQG LDFSDVGGDN EAVVSKVLTA KTPGFYVFVA PWETVRELMV NADRTEGYAL QLPEEYPGPN TIYAIAVAPS GSASLVDYDT KANPGASYPT LPAPVPTTTC TVRTPESVTV TGGVDGAVVP ANANTDEVLY MIRHAEAHPQ GYWSDNNYVA AGNWRALALP SALEGKSNPD EVWSGDPSSF GMGTMSNTGQ NYFSGVAPPL TVVPYVIAKD LPYHLVAGFD MTAAASASQS SQFFFTGGRF SQRKVLLGWM YVQNQQIINA LFASYYPNGG APVVPTWSPL DYDSLWTVTF DGQGNFTVDY SRCEGIDSAA LPATAPQF
|
| |