Gene Caul_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1905 
Symbol 
ID5899360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2042734 
End bp2044014 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID641562395 
Productmajor facilitator transporter 
Protein accessionYP_001683532 
Protein GI167645869 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGACCT CCTCGTCCCC CGCCCTCGCC AAGGGCGCCT GGTACACCCT GGTCATCCTG 
ACGCTCGTCT ATGTCTCGAA CTCGATCGAC AGGACGGCGA TGTCGATCCT CATCGAACCG
GTGAAGGCGG AGTTCAAGCT TTCGGACAGC CAGCTGGGCC TGCTGACGGG CCTCGCGTTC
GGCCTGACCT ATGCCCTGGC GGGCCTGCCG TTGGGGTGGC TGATCGACCG GGTGAACCGC
ACCAGGCTGC TCGCGGCGGT GGTCGCGATC TGGAGCCTTT GCACCGCCGT CTGCGGCCTC
GCCCAAAGCT ATCCGGCGCT GGTGATGGCG CGGCTGGCGG TCGGCGCGTC GGAATCGGCG
GCGGCGCCCA CGGCGATGTC GATGATCGCG GACCTCTTTC CCAAGAACCG GCGCTCGACG
GCCATGGGCG TGTTCTGGAC CAGCACGGCT TTCGGCACGG CCATCAGCCT CGTGCTCGGC
GGCGTGATCG CCGCCAACTA CGGGTGGCGC GCGGCCTTCT TCGTCGCCGG CGTTCCTGGA
CTGATCCTGG CCGTCCTGAT CATCCTGACC GTCCGTGAAC CCGCCCGCGA GCGCGATCTC
GGCCAAGGCG ACGCCGGGCC GGCGCCGTCG CTGTTTCAGA CGCTGCGGTT CGTCTGCGCT
AATCCGACGG TCTTCCACGC CTTCGTCGGC ATAGGGCTGG CCTCATTGGC CATGTCGGGC
GTTCCGGTAT GGGCCGCGTC CTTTCTGGTC CGCACCCAGG GCTTCACCCT GCCGCAGGCC
GGCCTGATGG CGGGCCTCGG CGTCGGGCTC TTCGGCGCGC TGGGATCGCT CATGGGCGGT
CCGGTCGGCG ACGCCGTGGT TCGTCGTTGG GGCGTCCAGG CCTTGCCGGC CGCGCCGATG
GTCGCCTGCG TTCTGGCCTG CGCTTCGGGT CTTGTCTTCG CCCTGGGGTC GTCCCTCGCG
GTCGTGGCCC TTGGCTTTAT CGTCTTCGAG ATCGTCTCGC GCGGCTTTAC CGCTCCGGCC
TATGCGATCC TCGTCACCGG CGTGGAGCCG CGCATGCGAG GCGTCGTCGT GTCGGCGGTC
CAAGCCGTGA CCAATCTCAT CGGTTACGGC GTTGGCCCCC TGGTCGTGGG CGTAGTCAGC
GACCGCGTCG GGGGAACCCA CTCCCTTAAG GCCGGCATCG CCGCGGTGAT GATCTTCAGC
CTATGGTCGG GCCTGCATTT CTTCGCCGCT TGGGCCGCGG CGCGCCGCTC GGAGCGTTTC
GCCGGCGGAG CAACCGCATG A
 
Protein sequence
MQTSSSPALA KGAWYTLVIL TLVYVSNSID RTAMSILIEP VKAEFKLSDS QLGLLTGLAF 
GLTYALAGLP LGWLIDRVNR TRLLAAVVAI WSLCTAVCGL AQSYPALVMA RLAVGASESA
AAPTAMSMIA DLFPKNRRST AMGVFWTSTA FGTAISLVLG GVIAANYGWR AAFFVAGVPG
LILAVLIILT VREPARERDL GQGDAGPAPS LFQTLRFVCA NPTVFHAFVG IGLASLAMSG
VPVWAASFLV RTQGFTLPQA GLMAGLGVGL FGALGSLMGG PVGDAVVRRW GVQALPAAPM
VACVLACASG LVFALGSSLA VVALGFIVFE IVSRGFTAPA YAILVTGVEP RMRGVVVSAV
QAVTNLIGYG VGPLVVGVVS DRVGGTHSLK AGIAAVMIFS LWSGLHFFAA WAAARRSERF
AGGATA