Gene Caul_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4020 
Symbol 
ID5901482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4354671 
End bp4356113 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content66% 
IMG OID641564541 
Productsugar transporter 
Protein accessionYP_001685643 
Protein GI167647980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.997993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATCTG TATCCAACGC CGGGCCCAGC CCGGGGATGA GCGCCGACGG CGCGAAGGTC 
AACATGGCCT TCATCGCCGC CATCGTGGCC GTCGCCACCA TCGGCGGCTT CATGTTCGGC
TACGACAGCG GCGTCATCAA CGGCACGCAG GAAGGCCTCG AGAGCGCCTT CAACCTCAGC
AAGCTGGGCA CCGGCCTGAA CGTCGGCGCG ATCCTGATCG GCTGCGCGTT CGGCGCCTTC
GCGGCCGGCC GCCTGGCCGA CGTCTGGGGC CGCCGCACGG TGATGATCAT CGCCGCCCTG
CTGTTCCTGG TCAGCGCCAT CGGCTCGGGC GCCGCCCACA CCTCCATGGT GTTCATTTTC
TTCCGCCTGA TCGGCGGCCT GGGCGTGGGC GCGGCCAGCG TGCTTTGCCC GGTCTACATC
TCGGAAGTGA CGCCGGCCAA CATCCGCGGC CGGCTCTCAT CCGTGCAGCA GATCATGATC
ATCACCGGCC TGACCGGCGC GTTCGTGGCC AACTACATCC TGGCCCACAC CGCCGGCAGC
TCGACGGCGA TCTTCTGGAT GGGCTTCCCG GCCTGGCGTT GGATGTTCTG GATGCAGACG
ATTCCCGCCG CGATCTTCTT CTTCAGCCTG CTGTCGATCC CGGAAAGCCC CCGCTACCTG
GTGGCCAAGG GCAAGGACGC CGAGGCCTCG GCGATCCTCT CGCGCCTGTT CGGCCAGGGT
GAGGGCGACA AGAAGGTGGC CGAGATCCGC GCCTCCCTGG CCGCCGACCA TCACAAGCCC
AAGATGAGCG ACCTGATCGA CCCGATCACC AAGAAGCTGC GCCCGATCGT CTGGACCGGC
ATCGGCCTGG CCGTCTTCCA GCAGTTGGTC GGCATCAACA TCGTCTTCTA CTACGGCTCG
GTGCTGTGGC AGTCGGTGGG CTTCTCGGAA GACGACAGCC TGAAGATCAA CATCCTGTCG
GGGTCGCTGT CGATCCTGGC CTGCCTGCTG GCCATCGCCC TGATCGACAA GATCGGTCGC
AAGCCGCTGC TGCTGATCGG CTCGGCCGGC ATGGCCGTCA CCCTGGGCAC GGTGGGCTAC
TGCTTCTTCC AAGGCTCGAT GGTCAACGGC GCGCTCAGCC TGCCGGGCAA TTTCGGCCTG
ATCGCCCTGA TCGCCGCCAA CGCCTATGTG GTGTTCTTCA ACCTCTCATG GGGTCCGGTC
ATGTGGGTCA TGCTGGGCGA GATGTTCCCC AACCAGATCC GCGGCTCGGG CCTGGCCGTC
GCCGGCTTCG CCCAGTGGAT CGCCAACTTC GGCATCTCGG TCAGCTTCCC GGCCATGGCC
GCGGGCCTGG GCCTGCCGGT CACCTACGGC TTCTATGCCC TGAGCGCCCT GATCTCGTTC
TTCTTCGTCC AGAAGATGGT TCGCGAGACC CGTGGGCAAG AGCTGGAAGA CATGGTGGGG
TAG
 
Protein sequence
MASVSNAGPS PGMSADGAKV NMAFIAAIVA VATIGGFMFG YDSGVINGTQ EGLESAFNLS 
KLGTGLNVGA ILIGCAFGAF AAGRLADVWG RRTVMIIAAL LFLVSAIGSG AAHTSMVFIF
FRLIGGLGVG AASVLCPVYI SEVTPANIRG RLSSVQQIMI ITGLTGAFVA NYILAHTAGS
STAIFWMGFP AWRWMFWMQT IPAAIFFFSL LSIPESPRYL VAKGKDAEAS AILSRLFGQG
EGDKKVAEIR ASLAADHHKP KMSDLIDPIT KKLRPIVWTG IGLAVFQQLV GINIVFYYGS
VLWQSVGFSE DDSLKINILS GSLSILACLL AIALIDKIGR KPLLLIGSAG MAVTLGTVGY
CFFQGSMVNG ALSLPGNFGL IALIAANAYV VFFNLSWGPV MWVMLGEMFP NQIRGSGLAV
AGFAQWIANF GISVSFPAMA AGLGLPVTYG FYALSALISF FFVQKMVRET RGQELEDMVG