Gene Caul_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0868 
Symbol 
ID5898323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp924629 
End bp925957 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content70% 
IMG OID641561351 
Productmajor facilitator transporter 
Protein accessionYP_001682497 
Protein GI167644834 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.201883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGA CCGGCGCTGA CGTCAGCCTC AGCGAACGCG ACAAGGGCCG GGCCTTCGCC 
ATCCTGTTCG CCGTGCTGCT GTCCAGCGCC GCGGGCAACA CCGCCCTTCA AACCGTCCTG
CCCGCCATTG GCCGCCAGGT CGGCATTCCC GACGTGCTGA TCAGCTCGAT CTTCTCGCTG
TCGGCCCTGC TGTGGGGCGT GATGTCGCCG GTCTGGGCGC GGATGTCCGA CAAGCACGGC
CGCAAGCCGA TGGTGGTGCT GGGCATGGCC GGCTTCGCGG TCTCTATGCT GGGCTTTGGC
TTCTTCATCT TCATGGGGCT CAAGGGCCTG ATGGTCCCGC TGGCGGTGTT CGCCGGAGCG
ACCCTCTCGC GGGCGATCTT CGGCCTGGTC GGCTCGGCCT CGAACCCGGC GGCCCAGGCC
TATGTCGCCG ACCGCACCGC CCCCGTCGAC CGCACCAACG CCCTGTCGAC CATGGCCTCG
GCCAGCGGGC TGGGCACGAT CCTCGGCCCG GCCGTGGCGC CGTTCCTGGT CTTTCCGCTG
CTGACCCTGT CGGGGCCGAT GTTCGCGTTC GCGGCCATCG CCGTGGTGGT GCTGGTGCTG
GTGATCCGCG GCCTGCCCGA ACGGCCCGAC GAGATCCCCG ACCGCGAGGG CGACAGGGCC
AGGCAGCCAC GAGCGAGGGT GCGCTGGAAC GACCGGCGGA TCATGCCCTT CATCCTCTAC
GGCTTCCTGC TGGCCAGCGC CCAGACCGTG AACCAGCAGA CCCTGGGCTT CATGGTCATC
GACAAGCTGA ACATCTCGCC GGCCAAGGCC GCCGCCTTCG CGGGCGTGGC GATGATGGCC
GGCGCCGTGG CCAGCCTGCT GGCCCAATGG GGCCTGATCC GCATGCTGCG CCTGACCCCG
CGCATGCTGC TGTGGCTGGG CGCGGGCTGC GCGGCCGTGG GCAACCTGAT CGTCGCCTTC
TCGCCGGACT ACCACACCCT GGTCGTCGGC TTCGCCCTGT GCAGCCTCGG CTATGGCTTC
GCCCGCCCGG GCTTCACGGC TGGCGCCTCG CTGTCGGTGG GCCACGAGGA GCAGGGGGCC
GTGGCCGGCG CGATCAGCGC CATCAACGGC GCCTCGGTGA TCATCGCCCC GGTGCTGGGC
GTGGCGCTCT ACAAGTGGGC CCATCCCTCG CCCTACCTGA TGAACGTCGC GATCCTGGCG
GGCCTGGCCA TCTACGCCCT GCTAAATCCC GTCATGCGCC GCGTGGGCGA CGCCGAGCAG
GCCCGGGAAC GCCGCGACGA AAGCCAGGTC GTCGACGCCA GCTCGATCGA CGCGACGGGA
CCGCACTAG
 
Protein sequence
MTGTGADVSL SERDKGRAFA ILFAVLLSSA AGNTALQTVL PAIGRQVGIP DVLISSIFSL 
SALLWGVMSP VWARMSDKHG RKPMVVLGMA GFAVSMLGFG FFIFMGLKGL MVPLAVFAGA
TLSRAIFGLV GSASNPAAQA YVADRTAPVD RTNALSTMAS ASGLGTILGP AVAPFLVFPL
LTLSGPMFAF AAIAVVVLVL VIRGLPERPD EIPDREGDRA RQPRARVRWN DRRIMPFILY
GFLLASAQTV NQQTLGFMVI DKLNISPAKA AAFAGVAMMA GAVASLLAQW GLIRMLRLTP
RMLLWLGAGC AAVGNLIVAF SPDYHTLVVG FALCSLGYGF ARPGFTAGAS LSVGHEEQGA
VAGAISAING ASVIIAPVLG VALYKWAHPS PYLMNVAILA GLAIYALLNP VMRRVGDAEQ
ARERRDESQV VDASSIDATG PH