Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2559 |
Symbol | |
ID | 4072203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3023005 |
End bp | 3024228 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984576 |
Product | major facilitator transporter |
Protein accession | YP_591634 |
Protein GI | 94969586 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00891] putative sialic acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCG CCCCCGGAAG CGCCTCCGGC CTGACCACCT CGCAACGCAC CCATGCAGTT CTCGCCGGCT ACCTCGGCTG GACGATGGAC GCGTTCGACT TCTTCGTCGT GGTGTTCATG CTCGGCACCC TCGCAGAAGC GTTCGCGGTA AAGAAATCCG AAATCGTCTT CACCATGACG ATCACATTGG CGATGCGTCC GGTGGGCGCG TTCCTGTTCG GTTTGCTGGC GGACCGGTTC GGACGCCGCG TTCCGTTCAT GGCGAATGTC ATCTACTTCT CGCTGATCGA GGTGCTCTGC GGCTTCGCTC CGAATTACAA AGTCTTCCTG CTGCTCCGCG CGCTCTACGG TATCGGCATG GGCGGCGAGT GGGGAATTGG CGCATCGCTG GCGATGGAGA GCATCCCACA GCGTTTGCGC GGTATGGTCT CCGGTGTCTT GCAAAGCGGC TATTCCGCCG GATACTTGCT GGCGGCGCTC GCCTATCGTT TCGTTTTTCC AGGTCTCGGT TGGCGATGGA TGTTCTGGAT CGGAGGCATA CCGGCGGTAT TGGCGTTGTA CATCCGCTGG CATGTGCCCG AATCCGACGC GTGGAAGGAG CATGCCGCGA ATAAGGTGTC TGACATCATG CGGGTGTTCG CGGGCTATTG GAAATCGTTT GCATATCTGC TCGTGATGAT GACGCTGTTT ATGTTCCTCT CGCATGGCAC GCAGGACCTT TATCCTGACT TCCTGAAAAC CGAACACAAT CTCAGCGCCG CATGGGTTTC GTATATTGCG ATCATCTACA ACATTGGCGC GATTGTCGGG GCGATCATCT TCGGCCTGAT CTCGCAGCGA ATGGGGCGAC GGAAGGGAAT TGTCTTCGCC CTCTTCCTGT CGTTCCTCAC GATTCCCGCC TGGGCATTCG GCCACGGCTT GGTCGTGGTT GCGGCCGCGG CGTTCCTGAT GCAAGTCGGC GTTCAAGGCG CGTGGGGCGT TGTGCCGGTG CACCTGAACG AGCTTGCGCC TGACGCGGCT CGCGGGCTCG TGCCGGGCTT TGCCTATCAA CTCGGCATCC TGTTTGCGTC CGGTACGAAC AACATTGAGT ACGCGCTGCG CGATCATTTC GGATATCGCT GGGCGCTTGC CGGGTTTGAA ATTTTCACCA TCATCAGTCT CGCCATCGTG GTCTGGTTTG GGCGCGAGGC GCACGGCAAA CAATTCAGCA AATTGAGCAC TTGA
|
Protein sequence | MDTAPGSASG LTTSQRTHAV LAGYLGWTMD AFDFFVVVFM LGTLAEAFAV KKSEIVFTMT ITLAMRPVGA FLFGLLADRF GRRVPFMANV IYFSLIEVLC GFAPNYKVFL LLRALYGIGM GGEWGIGASL AMESIPQRLR GMVSGVLQSG YSAGYLLAAL AYRFVFPGLG WRWMFWIGGI PAVLALYIRW HVPESDAWKE HAANKVSDIM RVFAGYWKSF AYLLVMMTLF MFLSHGTQDL YPDFLKTEHN LSAAWVSYIA IIYNIGAIVG AIIFGLISQR MGRRKGIVFA LFLSFLTIPA WAFGHGLVVV AAAAFLMQVG VQGAWGVVPV HLNELAPDAA RGLVPGFAYQ LGILFASGTN NIEYALRDHF GYRWALAGFE IFTIISLAIV VWFGREAHGK QFSKLST
|
| |