Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5135 |
Symbol | |
ID | 5897345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | + |
Start bp | 54795 |
End bp | 55994 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641555238 |
Product | major facilitator transporter |
Protein accession | YP_001676569 |
Protein GI | 167621784 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.774559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACCC GTCCGTCCCC GGCGCGAGAC GACGCCGCCC CCCGTCTGGC CAAACCGGTC GAACGCTGGG CCGCCGTCTT CTCGGTCGCC GTCGCCTCCT TCGCGCTGGT GACCACCGAG TTTCTGCCGG TGGGCCTGCT CAGCGCGATC GCCCAGGATC TGGGCGTCAG CGTCGGGGCG GCCGGGTTGA TGATCAGCCT GCCTGGCGCG ACGGCCGCCC TGGCCGCACC GATCCTGACG GTGCTTAGCC GCACCCTGGA TCGGCGTGTC CTGCTGCTGG CGATGACCGC CTGCTTGATT GGTGCGGACG TCGTCTCCGC CCTCACCCCC AACTTTGCCT TGATGCTGGC GGCTCGGGCG GTGCTGGGGG TGGCGATCGG CGGCTTTTGG GCGGTCGGCG CGGCGGTGGG CGGGCGCCTG GTGGGCGAGA CGCAAGCGGG CCGCGCCACC GCGATTATCT TTTCGGGGAT CTCTCTGGGC GCCCTGCTCG GCGTGCCGGT GGGCGTCTTC CTGGGGGCGC TTTCCGGTTG GCGCACGGCC TTTTGGGCGG CCGGAGGGCT CTCTCTGGTC ATCCTGATCG CCCAGGCGGT CTGGCTGCCC AAGCTGCCGG GCCTGCGCGC GGTGCAGGTC AAGGACCTGT TCGGGATCTT CGCCAATCGC AACGCCCGCG TGGGCCTGCT GGCAGTGTTT TTCGCGGTGG CCGGCCAGTT TGCGGCCTAT ACCTTCGTTA ATCCCGTTCT CCTGGACGTT ACGCGCCTGA CGCCCACGGC GCTGAGTCAG GTGTTCTTCG CCTACGGGGT GGCGGGCTTT TTCGGCAACT TCCTGGGCGG GCATGGCGCG GGCAAGAACG TTCGCGCCGC CAAGTTCGTG GTGTTGCTGG CCTTGGGCGG CGTCATCATC GCCTTTGCCC AGTTGGCGGC CCACCCGCCG GCGGCGATCT TGCTGCTGAC GGCCTGGGGA CTGGTCTGGG GCGGCCTTCC GATCTTGCAC CAGGCCTGGG CTATGCGCGC GTCGCCGGGC ATGGCCGAAG GCGGATCGGC CCTCTTCGTT TCAGTGTTCC AAGGCTCGAT CGCCATCGGC TCGGGCTTGG GCGGCGCGGC CGTGGAGACG GTGGGCCTGG TGTGGGGGTT GACCCTGGGC GGGCTGTCGA TCCTGATCGC GCTCGTGGTT TCGGTGCTCG GGTCCAAGCC TTTCCGCTAG
|
Protein sequence | MMTRPSPARD DAAPRLAKPV ERWAAVFSVA VASFALVTTE FLPVGLLSAI AQDLGVSVGA AGLMISLPGA TAALAAPILT VLSRTLDRRV LLLAMTACLI GADVVSALTP NFALMLAARA VLGVAIGGFW AVGAAVGGRL VGETQAGRAT AIIFSGISLG ALLGVPVGVF LGALSGWRTA FWAAGGLSLV ILIAQAVWLP KLPGLRAVQV KDLFGIFANR NARVGLLAVF FAVAGQFAAY TFVNPVLLDV TRLTPTALSQ VFFAYGVAGF FGNFLGGHGA GKNVRAAKFV VLLALGGVII AFAQLAAHPP AAILLLTAWG LVWGGLPILH QAWAMRASPG MAEGGSALFV SVFQGSIAIG SGLGGAAVET VGLVWGLTLG GLSILIALVV SVLGSKPFR
|
| |