Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2122 |
Symbol | |
ID | 4072364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2537002 |
End bp | 2538276 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984137 |
Product | major facilitator transporter |
Protein accession | YP_591197 |
Protein GI | 94969149 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.74746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGTC CCACGTCCAT CTCCATGCGG TCGTACTGGC GACTGTTGCG CAACAACCGC AACTTCCGTC GTCTCTGGAT CGCGCAAGTC GTGAGCGAGA CCGGCGACTG GTTCTACATG GTCGCGCTTT ACGCGATGCT GCTGGAATTC ACTGGGCGCG CGGAGGTCCT CGGCATCGCG TTCGTACTGC AGGTGCTTCC GCAGGCGCTC ACCGGACCCA TCGCAGGCGT TATCAACGAT CACTTCAGCC GCAAACGAGT GATGGTCTTC ACCGACATCG CGCGATTCCT GATCATTAGT TGCGTTCTCT TCATTCGCTC CGCCAGCCAG GTCTGGATGA TCTATCCCCT TCTCTTTATT GAAACCGTGA TGTGGGGCTT GTTCGAACCG GCGCGTAATT CCGTGATTCC CAATGTTGTG AGCGAAGAAG ATGTGATCGT CGCCAACACC GTCAGCTCGA CGACCTGGTC TGTAAACCTG TTTCTCGGCG CCGCGCTCGG TGGCATCGCG GCCGTATGGC TAGGCCGGGA CCTTACCATC ACTCTCGATG TAATGACGTT CCTCGTCTCA GCATGGTTAA TCGCCGGGAT GAAATTCCAG GAACCGCACC TTCAAGGTCT TGAACACATC CGACTACGCG ATGCGATCAA CTTTGCGCCC ATGTTGGACG GCTTCCGCTA CATCGCTCGC CAGCCACGCA TGCTGACTAC CGTTCTTGTG AAAGCCGGCA TGGGTCTGAG CGGAGCAAGT TGGGTGCTCT TCCCGATTCT CGGCAGGCAG GTATTTCCGA TCTTCCGTGC CGGATTCACG ACTGAGAAAG CAGCCCTGGC CGGGATCAGC GCGCTTATGG CCGCCCGCGG ACTCGGTTCT GCCCTCGGGC CGGCACTTGG CGCACCATGG GCTCAACAGA ATTTCCGACG CCTGCGCTAC GGCATCTTCC TCGGGTTCCT TGCTTCAGCT GCGGGTTACT GGGCCTTGGC CTTCACGCAT ACCGCGTGGA TTGCTTATCT CGAAATCATT GGGTCGCACG CCGGCAGCGC GGTCGTCTGG GTGTTCTCCA CCACGCTTCT CCAACTGATG AGCGAAGACA AGTTCCGGGG CCGTCTTTTC TCCGCCGAAC TTGCATGCTG CACCATCACG CTCGCGGCCA CGTCCTTTGC CGCCGGGTAC GCTCTCGATC GCGGAGTCGC TCTGAATACA GTTCTCTTTT GCACGGGCCT GATCATCGCC GTCCCGTGGC TGCTGTGGGG AGCAGTCGGA TTAAAGAAAG ATTAA
|
Protein sequence | MPGPTSISMR SYWRLLRNNR NFRRLWIAQV VSETGDWFYM VALYAMLLEF TGRAEVLGIA FVLQVLPQAL TGPIAGVIND HFSRKRVMVF TDIARFLIIS CVLFIRSASQ VWMIYPLLFI ETVMWGLFEP ARNSVIPNVV SEEDVIVANT VSSTTWSVNL FLGAALGGIA AVWLGRDLTI TLDVMTFLVS AWLIAGMKFQ EPHLQGLEHI RLRDAINFAP MLDGFRYIAR QPRMLTTVLV KAGMGLSGAS WVLFPILGRQ VFPIFRAGFT TEKAALAGIS ALMAARGLGS ALGPALGAPW AQQNFRRLRY GIFLGFLASA AGYWALAFTH TAWIAYLEII GSHAGSAVVW VFSTTLLQLM SEDKFRGRLF SAELACCTIT LAATSFAAGY ALDRGVALNT VLFCTGLIIA VPWLLWGAVG LKKD
|
| |