Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1194 |
Symbol | |
ID | 5898649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1255864 |
End bp | 1257114 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561677 |
Product | major facilitator transporter |
Protein accession | YP_001682822 |
Protein GI | 167645159 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGC TTTCCAGTCC CACGACTCCG AACGCCTCGA CGACGCCGGA CGCCAGTCCG CGCGCGCTCT ACGTCCTGCT GCTGGTGGTG TTCATCAACC TGGTGGGCTT CGGGTTGGTG ATCCCGCTGC TGCCCTTCTA CGCCAAGTCG CTGAACGCCA GCCCGTGGCA GGTGACGGCG CTGTTTTCGG CCTATTCGCT GGGTCAGTTC GTCGCCGAGC CGTTCTGGGG CCGGCTTAGC GACCGCATCG GCCGGCGACC GGTGCTGATC GTCACCATCC TGGCCAACAC CGTCTCCTAT GTGGCCCTGG CCTTCGCCCC AAACATCGCC TGGGCGTTCG CCATCCGCCT GGCCAGCGGT TTCGGCAGCG GCAACATCTC GACCATCCAG GGCTACATGG CCGACGTCAC CCCGCCCGAG AAGCGGGCCG GACGCATGGG CCTGCTGGGC GCGGCGTTCG GCATGGGCTT CGTCGTCGGC CCCACCCTGG GCGGCCTGCT GCCCGGCCTC GCCAAGGTCT TCGGCCATTC CGACACCGGC CGCCTGGCCT TCCAGATCCC GCTGCTGACC GCCGCTGTCC TGGCCGCCAT CGCCTCGCTG GGGGTGTTCC TGTTCGTGGT CGAGAGCCGC GCGCCCAGCG CCAAGGACGC GCCCCGGCCG CACCGCCGCG AGGCCCTGGA GATGGCGCGC GCCCACCCCG TGCTGTCGCG GGTGCTGCTG GTCACCCTGA TCTCGACCGC CGCCTTCGCC GGCATGGAGG CGGTCTTCGG CCTGTGGACC CAGGCCCGGT TCGACTGGGG ACCCAGGCAG GTCGGCCTGT GCTTCGCGGT GATCGGGATC ATCGCCTCGA TCGGCCAGGG CCTGATCACC GGTCGGCTGG CGCGCCGCTT CGGCGAGGCC AAGGTGCTGA CCGCAGGCCT GTCGATCATC GCCGTCAGCC TGGCCCTGAC GCCGTTCGTG CCGACCAGCG CCTTCGTGCC GGTGGTCGTG GGCTGCACGG CGTTCGGCCA GTCGCTGGTG TTTCCCTGCG TCGCCGCCCT GATCTCGCGC GCCACCCCGC CCGACAAGCA GGGCGCCATG CTGGGCCTGA ACATGGCCGC GGGCTCGCTG GCCCGCATGG CCGGCCCGAT GCTGGCCGGC CCGCTGTTCG GCCTGGCGAT CGGCGGCCCC TACTGGCTGG GAGCCGTCTT GATGATCCCC GCCATCGCCT TCGCCCTGAC GATCGAGCAC CGGGCCAAGG CGGCGGCGTA G
|
Protein sequence | MTTLSSPTTP NASTTPDASP RALYVLLLVV FINLVGFGLV IPLLPFYAKS LNASPWQVTA LFSAYSLGQF VAEPFWGRLS DRIGRRPVLI VTILANTVSY VALAFAPNIA WAFAIRLASG FGSGNISTIQ GYMADVTPPE KRAGRMGLLG AAFGMGFVVG PTLGGLLPGL AKVFGHSDTG RLAFQIPLLT AAVLAAIASL GVFLFVVESR APSAKDAPRP HRREALEMAR AHPVLSRVLL VTLISTAAFA GMEAVFGLWT QARFDWGPRQ VGLCFAVIGI IASIGQGLIT GRLARRFGEA KVLTAGLSII AVSLALTPFV PTSAFVPVVV GCTAFGQSLV FPCVAALISR ATPPDKQGAM LGLNMAAGSL ARMAGPMLAG PLFGLAIGGP YWLGAVLMIP AIAFALTIEH RAKAAA
|
| |