Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3554 |
Symbol | |
ID | 5901009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3838580 |
End bp | 3839851 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564062 |
Product | major facilitator transporter |
Protein accession | YP_001685179 |
Protein GI | 167647516 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGG TGGAGGTCGC GGCGATCGAC CAGGCGAGGC CGACCCAGCC CGTGACGCCG CGCTTTATCG CCGCCTTCAC CGCGGCCCAG ATCGGCGCCT TCGTCAGCTT CATGCCCCTG CTTCAGGTGT TGCTGCCGCT CAAGGCCGAG TCGATCGATT CGGCCAACAA GGCGGTGGTG CTCAGCCAAG TGGCCATCTA TGGCGCCTTG GTGGCCAGCG TCGCCAACCT GCTGGCCGGC GCGATCAGCG ACCGGACGAC GTCGCGCTTC GGTCGACGCC GGCCCTGGAT GGTCGTCGGG ACCCTGGGGA CCGTGGCCTC GTACCTGATG ATCATGGCCG CCCATACGAC GCTGCAGCTG ATGGCCGGGG TGGTCTGTTT CCAGCTGGCC TTCAACATGC TGTTCGCCGC CCTGCTGGCG GTGTTGCCGG ACCGCGTGCC CGACGCCCAG AAGGGCAGGG TGGCGGCCTT CCTCAGCCTG GGTCATCCGA TCGGGGCCAT GGCTGGCGCC GTGCTGGTGG GCGGCATGCT GGTCAGCGAG GGGGCGCGCT ACCTGGCCAT CGCCCTGGTG CTGCTGATCG CCATCGCGCC GTTCGCCCTG GGCCTGGACG ACAAGCCCCT GCCGGTCGAG GACCGGCGGC CGTTCGGCTG GCGCGCGTTC CTGGGCGGGC TGTGGGTCAA TCCCCTGGCC CATCCCGACT TCGGCCTGGC CTGGATCAGC CGGTTCATGG TGCTGGTCGC GATCACCCTG ACCCAGAGCT ACATGCTCTA CTACCTGCAG GACGCGCTGC ACTATTCGCG GCTGTTCCCG GGCCAGCGGG CCGAGCAGGG CCTGGCCCTG CTGACCACCG TGGCCACCGG CGCCAACATC ACCTGCGCGA TGATCGGCGG CATGCTGTCC GACCGGCTGC GGCGGCGCAA GTTGTTCGCG GCCGGCGCGG CCCTGACCCT GGCGGGGGCC ATGCTGGTCT TCTCGATGAC GCCCGCCTGG CCGGTGGTGG TGGTGGGCTT CCTGATCTTC GGCTGTGGGG CGGGCTGCTA CTACGCCGTC GACATCGCCC TGGTCAGCCA GGTGCTGCCT TCGCAGAAGA ACGCCGGCAA GGATCTGGGG GTGATCAACC TGGCCAACAC CTTGCCCCAG GCCCTGGCGC CGATCCTGGC CCTGCTGTGC CTGGGCCCGC TGCACGTCAA CTATCACGCG CTCTTCGTGG TGGCGGCGGG CCTGGCGACG GCTGGCGGAC TGGCGATCCT TCCGATACGG GGCGTGCGTT AG
|
Protein sequence | MTAVEVAAID QARPTQPVTP RFIAAFTAAQ IGAFVSFMPL LQVLLPLKAE SIDSANKAVV LSQVAIYGAL VASVANLLAG AISDRTTSRF GRRRPWMVVG TLGTVASYLM IMAAHTTLQL MAGVVCFQLA FNMLFAALLA VLPDRVPDAQ KGRVAAFLSL GHPIGAMAGA VLVGGMLVSE GARYLAIALV LLIAIAPFAL GLDDKPLPVE DRRPFGWRAF LGGLWVNPLA HPDFGLAWIS RFMVLVAITL TQSYMLYYLQ DALHYSRLFP GQRAEQGLAL LTTVATGANI TCAMIGGMLS DRLRRRKLFA AGAALTLAGA MLVFSMTPAW PVVVVGFLIF GCGAGCYYAV DIALVSQVLP SQKNAGKDLG VINLANTLPQ ALAPILALLC LGPLHVNYHA LFVVAAGLAT AGGLAILPIR GVR
|
| |