Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0540 |
Symbol | |
ID | 4069998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 667547 |
End bp | 669514 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982545 |
Product | hypothetical protein |
Protein accession | YP_589619 |
Protein GI | 94967571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.920015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.216808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGT TCTGGTTTGC CCGCAACACT ACCTTGTGTG CTGTGGTTTT CCTCGGTTTA GGTCTGGCCC GTGCGATTGA TTTTCCGCCT GTCACTCCTG AACAACTTGC GATGAAAGAC AATCCGGCGC AGCCCGGGGC GAAGGCGATG ATTCTGTATC GAGCCATTGA GCGCGACGAC AGGATGGGCT CGCAGGCTGA GTATGCCCAA ATCAAGATCT TCACGCAGGA AGGCAAAGAC TACGGCGATG TTGTACTGGA TTTCGATCGG GGTGCATACA CGGTCGAATC GATCAAGGGC CGGACAATTC ATCCGGACGG AACCGTGATT CCGTTCAATG GCAAGGCGTA TGAAAAGGTG ATTGCCGAAG GCCAGGGATT CAAGGTTCAC CGCAAGGCAT TCACCATGCC CGACGTGACG CCCGGCAGCG TCCTTGAATA CAAGTACGTA GTGCGATGGG AGGCCTCGGA CCCGGCGACA CACCAATATT ATTATTTCCC TCGCTCCGAG TGGGAGGTAT CCAAGGAGCT CTACCAGCAG AGCGCGCATT TTGTGTTTAA GCCGCTGACG ATGGACGGAC TCTATTGGTC CCTGCGCAGC AACCGCCTTC CGCCGGATGC GAAGTTCAAT CACGAACAAC TTACGGACAA GGTCACCCTG GACCTGGTGA ATGTACCTGG CGTGGAGAAG GAAGAATTCA TGCCGCCGTC GTCAGAGACG AAGGCGCGAG TGCTCTTCTT CTATAGCGAT ACCCATATTC CGGAGCCCGA TCAGTATTGG AAAGACCATG GAAAGAAGTG GCACGGCTGG GCGGAAGGCT TCATGGACAA AAAGGGCGCG ATCCAGAAGG ACCTCGCCAG CGTGATTTCG TCGAGCGATT CGACGGATGT GAAGCTCCGG AAGATTTATG AGCACGTGCA GTCATTCGAG AACCTGGAAT TCGAATCGGC GAAGAGCGAC AAGGAAATCA AGGCGCTGAA GATCCGGGAC ATTAAGAGCA TTGAAGACGT GATCAATGGC AAGGCCGGGT ATCGCAATGA GCTGAACCGT ACATTCGTCG CGCTGGCGCG TGGAGCAGGA TTCGATGCGA CGCTGGTGGC GGTGACGGAG CGCGATACGG CCATCTTCCA CAAAGAGTGG CCATCTTCGT CGCAGCTCGC TTATGAGATC CCGCTGGTGA AGGTGAACGG TGCAGATATT TACCTCGATC CGGGGAGCCC GTTTTGCCCG TTCGGCGTGG TGCCATGGGA AGACACCGCA GTTTCAGGAC TGAAGCTCGA TAAGAACCCG CCGGTTTGGG CACAGATTCC ACTTCCACCC AGCGATGACT CGAGCATTAA GCGCGTCGCG AAGATGACGT TAGGCGACGA CGGTTCTCTG ACCGGCGAGG TCGAAGTGAC GTTCACGGGG CAGGATGCGT TCCATCATCG GCTCTGGGAG CGCAACGAGG ACGATGCCGG CAAGAAGAAA GATATGGAGG AATTACTCCA GGACTGGATG GCGCTGAAGG CGGATATTGA GTTGGAGAAG GTAAATGATT GGAAGGCGTC CAATGTTCCG CTCGTCGCGA CCTTCAAGGT GACGGTGGCA GGCTACGCGA GCCAGGCCGG CAAACGCGTG CTGATTCCCT GCACTTTGTT CGCCGCTGCC TACAGGAACC CGTTCACTCC GACGAAGCGG GTGAACCCAA TCATCATGCA TTACGCCTAC GACCGCAGTG ACGACGTCAC GATCAAGCTG CCGGCGAATT TCCAGGTGGA GAGCATGCCG AAGCCGGTCG CCGAACAGAA CAACATCGCG GACTTGAACG TGAAGTGCGA CAGCAACAAC GGAACGTTGC ACCTGGTGCG GGACTTCAAG CTGAAGGGCC TATTCATTGA TCAGAAGTAT TACGGAGCGG TTCGCGGATA TTTCCAGCAG GTCCAGGCGG GGGCAAATGA GCAAGCTGTA CTCAAGATGG GAAATTAG
|
Protein sequence | MKKFWFARNT TLCAVVFLGL GLARAIDFPP VTPEQLAMKD NPAQPGAKAM ILYRAIERDD RMGSQAEYAQ IKIFTQEGKD YGDVVLDFDR GAYTVESIKG RTIHPDGTVI PFNGKAYEKV IAEGQGFKVH RKAFTMPDVT PGSVLEYKYV VRWEASDPAT HQYYYFPRSE WEVSKELYQQ SAHFVFKPLT MDGLYWSLRS NRLPPDAKFN HEQLTDKVTL DLVNVPGVEK EEFMPPSSET KARVLFFYSD THIPEPDQYW KDHGKKWHGW AEGFMDKKGA IQKDLASVIS SSDSTDVKLR KIYEHVQSFE NLEFESAKSD KEIKALKIRD IKSIEDVING KAGYRNELNR TFVALARGAG FDATLVAVTE RDTAIFHKEW PSSSQLAYEI PLVKVNGADI YLDPGSPFCP FGVVPWEDTA VSGLKLDKNP PVWAQIPLPP SDDSSIKRVA KMTLGDDGSL TGEVEVTFTG QDAFHHRLWE RNEDDAGKKK DMEELLQDWM ALKADIELEK VNDWKASNVP LVATFKVTVA GYASQAGKRV LIPCTLFAAA YRNPFTPTKR VNPIIMHYAY DRSDDVTIKL PANFQVESMP KPVAEQNNIA DLNVKCDSNN GTLHLVRDFK LKGLFIDQKY YGAVRGYFQQ VQAGANEQAV LKMGN
|
| |