Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4447 |
Symbol | |
ID | 4070930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5279323 |
End bp | 5280687 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986486 |
Product | hypothetical protein |
Protein accession | YP_593521 |
Protein GI | 94971473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAACC GCCGATGTTC GGGGGCCATG ATTCTATTTG TCAGCCTGTT GCTGCCCACA ATCGCTCGAG GGCAGGAGCA GCCGAGTACC TCTCAACCAT CGGCTGAGGC GCTCGCTAAA CATCTTGAGG AGATGGAACG TGAGATCAAG GAATTGCGGG CCCAGGTGAA AGTGCTAACC GGTGAAAAAG CGGCGGCAGA GACCGCGCCA GCCGCGTCAG GGGCGTCGTC TGCTGTTGTG CAGAACTCGC TGGTAAGCGG CACTCAACCT GCCGCCGCTC CTTCGGCAAC CCCATCGCTG GCTTCGATTC TCGGGCCAAC CACGCTGAGC GGATTCGTGG ACGTGTATTA CGGCCAGAAC TTCAACAATC CCGAAAGCCA GAACAACGGC TTGCGCTATT TCGATCAGGG CGCAAACCAA TTCGGTTTGA ACTTGATGGA GTTGGTGATC GACAAGACGC CGGATCCCTC GAACAGCCGT ACCGGCTACC ACGTTGCCCT CGGCTATGGC CAGGCGATGA ACGCGGTCAA TGCCTCCGAA CCCAAAGCCG GGCTGGGCTT CGATCAGTAC CTGAAGGAAG CCTACTTCTC GTATCTCGCG CCTGTCGGAA AAGGGCTGCA ATTTGACGTC GGCAAGTTCG TCACGCCCGC CGGCGCCGAA GTGATCGAAA CCAAGGACAA CTGGAACTAC TCGCGTGGCG TGCTCTTCTC GTATGCCATC CCGTATTTCC ACTTCGGCAT GCGCACCAAG TACACCTTCA ACGACAAATA TGCGCTGACC GGTTTCTTCA TCAACGGTTG GAACAACGTT GTGGACAACA ACACCGGCAA GACCTACGGG GTCAACTTCG CATGGAACCC CAACAAGAAG TTTGGAATCG CCCAAACCTA CATGGCGGGT CCGGAAGAGA ACGGCCTCAA CCACAACGTG CGCCAGTTGA GTGACACGGT CTTCACCTAC ACGCCGACAG CGAGACTTTC GTTCATGTTG AACGGCGACT ACGGTCGTGG CGATCGCTAC GTCACCGACA CCGAAGCGAA CACCTTTTCG CATGCGGTGC ACTGGACGGG CGTAGCAGGC TACGCAAAGT ACGCATTGGC CCAGAACATG GCCATCGCCG GCCGATATGA GTACTACGAC GACGCCGACG GCTACACGCT CGGAACCCTG ACAACGACCC ACGTCAACGA ATTCACTGCC ACCTTCGAAC GGATCATCGG ACACCACATC ATCAGCCGCT TCGAGTTCCG TCGAGATATG TCGAACCAGC CGCTGTTCTA TAAGGGCAGC AATCCGGTCA CTGACCAGAA CACGCTGACC GCGGGCTTGG TTATGACCTT CAACAGCGGG GAGGGCGGCA AGTGA
|
Protein sequence | MRNRRCSGAM ILFVSLLLPT IARGQEQPST SQPSAEALAK HLEEMEREIK ELRAQVKVLT GEKAAAETAP AASGASSAVV QNSLVSGTQP AAAPSATPSL ASILGPTTLS GFVDVYYGQN FNNPESQNNG LRYFDQGANQ FGLNLMELVI DKTPDPSNSR TGYHVALGYG QAMNAVNASE PKAGLGFDQY LKEAYFSYLA PVGKGLQFDV GKFVTPAGAE VIETKDNWNY SRGVLFSYAI PYFHFGMRTK YTFNDKYALT GFFINGWNNV VDNNTGKTYG VNFAWNPNKK FGIAQTYMAG PEENGLNHNV RQLSDTVFTY TPTARLSFML NGDYGRGDRY VTDTEANTFS HAVHWTGVAG YAKYALAQNM AIAGRYEYYD DADGYTLGTL TTTHVNEFTA TFERIIGHHI ISRFEFRRDM SNQPLFYKGS NPVTDQNTLT AGLVMTFNSG EGGK
|
| |