Gene Caul_4768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4768 
Symbol 
ID5902230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5152528 
End bp5153772 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content74% 
IMG OID641565288 
Productmajor facilitator transporter 
Protein accessionYP_001686386 
Protein GI167648723 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.24612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTATCG AGAAGGCGGC GATCATGACC TCCAGTCCAG TTCCCGACGC GGTCGCGCCG 
GCCAAGGTCC ATGCGGCGGG TCTGGTGCTG GCGGCCCTGG CCCTGGGCGG CTTCGCGATC
GGCACCACCG AGTTCGCCTC GATGAGCCTG CTGCCCTATT TCGCCGCCAG CCTGGGCGTC
GACGCCCCGA CCGCCGGCCA CGCGATCAGC GCCTACGCCC TGGGGGTCGT GATCGGGGCG
CCGATCATCG CCGTGGCGGC CGCGCGCCTG CCGCGCCGGC TGATCCTGGT GGCGCTGATG
GCGGTGTTCG CGGTCGGCAA CCTGCTCAGC GCCCTGTCGC CGAGCTTCGG CTGGATGTTG
GTCTTCCGGT TCCTCAGCGG CCTGCCGCAC GGCGCCTATT TCGGCGTCGC GGCCCTGGTC
GCCGCTGGCG TCTCGCCGCC CGAGCGCCGG GCGCGGGCGG TGGCCATGGT GATGATCGGC
CTGACCGTGG CGACCATCGT CGGCGTGCCG CTGGCCAATG TCGTGGGCCA ATGGATCGGC
TGGCGCTGGG GCTTCGTGAT CGTCGCGGCG CTGGCCATGA TGACCGCCAC GGCGGTCTGG
CTGCTGGCGC CGCGCGACGC GGCCCATCCC GACGCCTCGC CGCTGCGCGA GCTGGGCGCC
CTGGGTCGGG GACGGGTGTG GCTGACCCTG GGGATCGGGG CGATCGGCTT TGGCGGCATG
TTCTGCGTCT ACACCTACCT GGCCTCGACC ATGGCCGAGG TCACCCACGC CTCGCCAGCC
GCCCTGCCGA TGGTGCTGGC GGTGTTCGGG GCCGGCATGA CCGTCGGCAC CCTGGTCTGC
GCCTGGGCCG CCGACCGCGC CCAGATGCCG GCCATCGGCG GCGTGCTACT GTGGAGCGCC
GCGGCCCTGG CTCTCTATCC GATGGCGACG GGCAGCCTGT GGACCCTGGC CCCGGTGGTG
TTCCTGATCG GTTGCGGCGG CGGACTGGGC GCGGTGCTGC AGACCCGGCT GATGGACGTG
GCCGGCGACG CCCAAACCCT GGCGGCGGCG CTGAACCACT CGGCCTTCAA CTTCGCCAAC
GCCCTGGGAC CGTGGCTGGG CGGCCTGGCC ATCGCCGCGG GCTACGGCTG GGCCTCGACC
GGCTATGTCG GCGCGGCCCT GGCCCTGGGC GGCTTCGCGA TCTGGATCGT GGCGGCGGTC
GATGTGCGGC GGACGCGGCG GGCGACGCTG GCGCCGGCGG AGTAG
 
Protein sequence
MRIEKAAIMT SSPVPDAVAP AKVHAAGLVL AALALGGFAI GTTEFASMSL LPYFAASLGV 
DAPTAGHAIS AYALGVVIGA PIIAVAAARL PRRLILVALM AVFAVGNLLS ALSPSFGWML
VFRFLSGLPH GAYFGVAALV AAGVSPPERR ARAVAMVMIG LTVATIVGVP LANVVGQWIG
WRWGFVIVAA LAMMTATAVW LLAPRDAAHP DASPLRELGA LGRGRVWLTL GIGAIGFGGM
FCVYTYLAST MAEVTHASPA ALPMVLAVFG AGMTVGTLVC AWAADRAQMP AIGGVLLWSA
AALALYPMAT GSLWTLAPVV FLIGCGGGLG AVLQTRLMDV AGDAQTLAAA LNHSAFNFAN
ALGPWLGGLA IAAGYGWAST GYVGAALALG GFAIWIVAAV DVRRTRRATL APAE