Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3822 |
Symbol | |
ID | 4071106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4518100 |
End bp | 4519287 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985845 |
Product | hypothetical protein |
Protein accession | YP_592896 |
Protein GI | 94970848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.333872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.578419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCG ATTTCGAAAA CCAATTCCTC GCAAAGGTAG TACCTGGCCC AGCCGAACGT CTTGTGTTGC TTGCGTGCTC AAGCGATTCG GGGACCAACA CTGAGATCAA AGCGCTTTAC GAACAGGTCG GAGACGAGGT CGCGTGGCAA GTCGCAAAAC AGCACGAGCT TGAAGGAAAT CTCGGGCACC GGTTGATCGA CATTTTGGGT GAGCAAGTTC CACTCCGTTG GCGCGCAGCG CATGAGACGG TGGGGAACAG GATTGGTGCG TATCTGAACG AAGTGGACCG AATGGCGGCT CGGTTAGCAC AACAGGACAT TCCTCTGGTT GCGCTAAAGA ATGCTGGTAT CGCCCGCGGT GTGTATCAGT GCGCGGGGTG CTCGCCAATG GGCGATGTCG ATCTCCTGGT TCGGCGCGCC GACTATCGGC GTGTCCACGC AATTCTGTTA GAAGAAGGGT TTACGTGTGA CTCGCGGAAT GTCACTGAGG AAGGGACCTT GGAAGAGGGC GAGGTAACGG GCGGAACCGA ATATCACAAG GAGATTCCGG ACGTTGGTAC GTTCTGGTTA GAACTGCAGT GGCGGCCAGT TTCGGGTCGA TGGTTACGTC CGGATCAGGA GCCAAATGGC GACGAACTCG TTAGTCGGTC TGTTCCGATT GAAGGCACCC ATTTGCGGCT GTTGAACCCT GAGGACAACC TCTTACAGGT GTGTCTCCAT ACGGCGAAGC ACACATACCT GAGGGCACCA GGGTTGCGCT TGCACACCGA CGTAGAGCGC ATCGTGAGGC AGTTGCAAAT CGATTGGGAG GCCTTCCTCG CAAAGGCGAA GGCCCTTCAG GTGCGGACTT CGACTTACTT TTCACTTTGG CTGCCGGCGC GGCTTCTCAA CACTCCAGTG CCTGACGCTG TGTTGTCGGA ACTCGCGCCC TCTCGGCGGA AGCGCAAGGC GATACTCAAG CGTTTGCAAA GAGCGGGACT GTTTTATCCG GCCCGACCGA AGTTTTCCAA CATCGCGTAC ATTCGGTTCA ATAGCCTTTT GTACGACAGT TCGAACGGAT TGGTCCGGGC GATTTTCCCC GACACAGAGT GGATGAAGAA GCGGTATGGT TTCCGGAGCG CCCTTTTACT CCCCTATTAC CACGTGCGAC GCATCGCGGA TTTGGGGCTG CGTCGAGTTG GGATTTGA
|
Protein sequence | MTSDFENQFL AKVVPGPAER LVLLACSSDS GTNTEIKALY EQVGDEVAWQ VAKQHELEGN LGHRLIDILG EQVPLRWRAA HETVGNRIGA YLNEVDRMAA RLAQQDIPLV ALKNAGIARG VYQCAGCSPM GDVDLLVRRA DYRRVHAILL EEGFTCDSRN VTEEGTLEEG EVTGGTEYHK EIPDVGTFWL ELQWRPVSGR WLRPDQEPNG DELVSRSVPI EGTHLRLLNP EDNLLQVCLH TAKHTYLRAP GLRLHTDVER IVRQLQIDWE AFLAKAKALQ VRTSTYFSLW LPARLLNTPV PDAVLSELAP SRRKRKAILK RLQRAGLFYP ARPKFSNIAY IRFNSLLYDS SNGLVRAIFP DTEWMKKRYG FRSALLLPYY HVRRIADLGL RRVGI
|
| |