Gene Caul_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1551 
Symbol 
ID5899006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1640760 
End bp1642064 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content66% 
IMG OID641562039 
ProductL-fucose transporter 
Protein accessionYP_001683179 
Protein GI167645516 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID[TIGR00885] L-fucose:H+ symporter permease 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAA GCCCGCAGGG AAGGACCACG TTCGCGCCGC TGGTTCTCAT CGTCGCCCTC 
TTCTTCCTTT GGGGCATCGC CAACAATCTC AACGACGTCC TAATTCCCCA TTTGAAGAAG
GCTTTCTTTC TCACCGACCT GCAGTCCGGC CTGGTGCAAT CGGCCTTCTA TCTTGGCTAT
TTCTTCCTGG CCCTGCCGGC CAGCGTCGTC ATGCGACGCC ACGGCTACAA GGCGGCGGTG
ATCGTCGGTC TGCTGCTGTT TGGTCTGGGC GCCTTGTTGT TCTATCCCGC CGCCGAGGCG
CGGCAGTATT CCTGGTTCCT GGCCGCCCTG TTCGTCCTGG CCTCGGGCCT GGCTTTCCTG
GAGACCTCGG CCAATCCGCT GATCACGGTG CTGGGCGATC CGGCCAAGGC CGAGCAGCGC
CTCAACTTCG CGCAGGCCTT CAATCCGCTG GGCTCGATCA CCGCCGTGGT GGTGGGGCGC
CAGTTCATCC TGTCGGGCGT GGAGCCGACG AAAGCGCAGT TCGCCGCCAT GACGCCGGCG
CAACTTCAGG CCTTCCAGAC CACCGAGGCC CAATCCACCC AGATTCCCTA TCTGATCATC
GCCGCCGTTG TGCTGGCCTG GGCGCTGCTC GTGGTCGTCA CCAAATTCCC CCGCCAGGCC
GGACGCCCGG ACCCAAACGA GGCCGACGCC GCCCTGCCCG CCGCCCAGGC CGTTCCCGCC
CTGCTGGCGC GACCGCGGTT CCTGTTCGGC GTGGCGGCCC AGTTCTTCTA CGTCGGCGCC
CAGGTCGGCG TCTGGAGCTA CATGATCCGC TACGCCCAGC ACGAGGTTCC GGGCATGGGC
GAGAAGACGG CGGCGGCCTA CCTGTCATGG TCCCTGGTCG GGTTCATGGC CGGACGTTTC
ATCGGTACGG CCGCGATGAG CCGGGTCAGC CCCTCGCTGA TGATGGGCGT GTTCGCCATG
ATCAATGTCG GCCTGACCCT GGTCGCGGTC GTCGCAGGCG GAAAGGTCGG GCTGTACGCC
CTGGCCGCCA CCAGCGTCTT CATGTCGATC ATGTTCCCCA CCATCTTCGC CGCCTCGCTG
AAGGGGCTAG GACCGCTGAC CAAGACCGGT TCATCCTTCC TGGTGATGAG TATCATCGGC
GGCGCGGTCC TGACGGCGGT GATGGGGGGC GTCTCGGACG CCAGCGCCAT CAACGTCGCC
ATCCTGGTGC CCTGCGCCTG CTTCGCTGTG GTCGGCCTGT TCGGTTTCAC TGCTGGCCGC
ACGGCCAGCC AGGACCTGAA GACCGCTCCC GTCGGAGCGC ACTAG
 
Protein sequence
MEKSPQGRTT FAPLVLIVAL FFLWGIANNL NDVLIPHLKK AFFLTDLQSG LVQSAFYLGY 
FFLALPASVV MRRHGYKAAV IVGLLLFGLG ALLFYPAAEA RQYSWFLAAL FVLASGLAFL
ETSANPLITV LGDPAKAEQR LNFAQAFNPL GSITAVVVGR QFILSGVEPT KAQFAAMTPA
QLQAFQTTEA QSTQIPYLII AAVVLAWALL VVVTKFPRQA GRPDPNEADA ALPAAQAVPA
LLARPRFLFG VAAQFFYVGA QVGVWSYMIR YAQHEVPGMG EKTAAAYLSW SLVGFMAGRF
IGTAAMSRVS PSLMMGVFAM INVGLTLVAV VAGGKVGLYA LAATSVFMSI MFPTIFAASL
KGLGPLTKTG SSFLVMSIIG GAVLTAVMGG VSDASAINVA ILVPCACFAV VGLFGFTAGR
TASQDLKTAP VGAH