Gene Caul_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4087 
Symbol 
ID5901549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4433805 
End bp4435349 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content68% 
IMG OID641564607 
Productmajor facilitator transporter 
Protein accessionYP_001685709 
Protein GI167648046 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG CGCGGCTGAA TTTCCTCCAG ATCTGGAACA TGTGCTTTGG ATTTTTCGGG 
ATCCAGATCG GTTTTGGCCT GCAGAACGCC AACACCAGCC GCATCTTCCA AAGCCTGGGC
GTCGATGTGG ATCACCTGGC GATCCTGTGG ATCGCCGCGC CGATGACCGG CCTGCTGGTC
CAGCCGATCA TCGGCTATCT GAGCGACAAG ACCTGGGGGC GCCTGGGCCG TCGCCGACCG
TATTTCTTCT GGGGCGCGAT CCTGACCAGC GCAGCCCTGC TGGTCATGCC CAACGCCCCG
GCGCTGTGGG TGGCCGCCGC GGCGCTGTGG ATCATGGACG CCTCGATCAA CATCACCATG
GAGCCGTTCC GGGCCTTCGT CGGCGACAAC CTGCCCGACG AACAGCGCGC CCAGGGCTAC
GCCATGCAGA GCTTCTTCAT CGGGCTGGGC GCGGTGCTGG CCTCGGCCCT GCCGTGGATG
CTGACCCACT GGTTCTCGGT CAGCAACGTG CCGGCCAGTG GCGGCGGCGT CCCGCCCTCG
GTGCATATCG CCTTCTATGT CGGCGCCGCC GGCCTGTTGC TGTCGGTGCT GTGGACGGTG
TTCACCACCC GGGAATACAG CCCCGACCAG CTCGCCGCCT TCGAGGTCGC CGAGCGGGCG
CGCAAGGGCG AGCCGCCGGT CGCGCCCGAG CCGCCGACCC GCGCCGCGCG GGCCTATCTG
ATCGGCGGGG CCGCCTGTGT CCTGGCCGGT CTGGCGCTGA CGGCCGTGAT CCTGGCTACC
CGGGCCGAGA AGGAGCTCTA CGTCCTGGCC GGCATGGCCG TCGCCTTCGG CCTGGCCCAG
CTGATCGCCG GCCTGCTGCG CGCGCGCGGA ACCATCGCCA ACGGCTTTTC GGAGGTGGTC
GAGGATCTGT TCCGCATGCC CGCCACCATG AAGCAGCTGG CGGTGGTGCA GTTCTTCTCG
TGGTTTGGCC TGTTCGCCAT GTGGATCTAC ACGACCCCCG CCGTGGCGGC GTTCCACTAC
CACGCCCTCG ACACCGCCTC GAAGGCCTAC AATGACGGCG CCGACTGGGT GGGCGTGCTG
TTCGCCATCT ATAACGGCGT GGCGGCCCTG GCGGCCCTGC TGATCCCGCT GATCGCCCGG
GCCACCAGCC GCAAGGTCAG CCACGCCCTG TGCCTGGGCC TGGGCGGGCT GGGCCTGCTG
TCCTTCCCGC TGATCCGGGA GCCGGCCCTG CTGTGGATCC CGATGATCGG CGTGGGGTTC
GCCTGGTCGT CGATCCTGTC GGCCCCCTAT TCGATCCTGT CGGGGGCGCT GCCGGCCCGG
AAGATGGGCG TCTATATGGG CATCTTCAAC TTCTTCATCG TCATCCCGCA ACTGCTGGCC
GCCACTGTGC TGGGCGTGCT GCTGCGGACC TTCTTCGGTG GCGAGGCCAT CTGGGCCCTG
GTGCTGGGGG CCGGCGGGAT GTTCGCCGCC GCCCTCTGTG TCTTCGCCGT TCGCGACCTG
GGCGAGCCCA GGGCCTTGGC CCTCGCCACA CCCGTTCAAG CCTGA
 
Protein sequence
MTKARLNFLQ IWNMCFGFFG IQIGFGLQNA NTSRIFQSLG VDVDHLAILW IAAPMTGLLV 
QPIIGYLSDK TWGRLGRRRP YFFWGAILTS AALLVMPNAP ALWVAAAALW IMDASINITM
EPFRAFVGDN LPDEQRAQGY AMQSFFIGLG AVLASALPWM LTHWFSVSNV PASGGGVPPS
VHIAFYVGAA GLLLSVLWTV FTTREYSPDQ LAAFEVAERA RKGEPPVAPE PPTRAARAYL
IGGAACVLAG LALTAVILAT RAEKELYVLA GMAVAFGLAQ LIAGLLRARG TIANGFSEVV
EDLFRMPATM KQLAVVQFFS WFGLFAMWIY TTPAVAAFHY HALDTASKAY NDGADWVGVL
FAIYNGVAAL AALLIPLIAR ATSRKVSHAL CLGLGGLGLL SFPLIREPAL LWIPMIGVGF
AWSSILSAPY SILSGALPAR KMGVYMGIFN FFIVIPQLLA ATVLGVLLRT FFGGEAIWAL
VLGAGGMFAA ALCVFAVRDL GEPRALALAT PVQA