Gene Caul_2862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2862 
Symbol 
ID5900317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3108039 
End bp3109385 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID641563359 
Productmajor facilitator transporter 
Protein accessionYP_001684487 
Protein GI167646824 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.114949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0171068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA AGATGTCCGA AGACGCGAAC GTCGAGATGG CGCCAACAGT CGAAACGCCC 
CGAAGGCAAA CCGCCGCCCC CAGCGGCGCG CGGATCACGC TGGCGATGCT GTGCTTCGTC
TATGTGCTCA ATTTCCTGGA TCGCCAGCTC ATCTCGATCC TGGCCAAGCC GATTCAGGAC
GGCTTGAAGA TCAGCGACGG CCAGCTGGGC CTGCTGACCG GCTTCTATTT CGCGCTCTTC
TATTGCTTCA TCGCCATTCC GATCGGCTGG CTGGCCGATC GCACCAGCCG CGTCAGGGTT
CTGGCCATCG CCTGCGCGCT ATGGAGCGGC GCGACCGCCG CGTGCGGACT GGTCGGCAAC
TATGGACAGC TCGTCGTCGC CCGCATGATG GTCGGCGTTG GGGAGGCTGG CGGCGTGCCG
CCGTCCTACG CCATCATCTC CGACTCCTTC CCGCGCGAGC GGCGGACCAC GGCCATGGCG
ATCTTCAATC TTGGCCCGCC GATCGGTTCG GCGCTGGGCA TCACCTTCGG CGCGTCGCTC
GCTTCCGCCT TCAGCTGGCG CATCCCCTTC TACGTGATCG GCGTGATCGG CGTGGTCGCC
GCGGTGGCCG TTCATCTGAT CGTTCGCGAG CCGAAGCGCG GACAGATGGA CAGTCCGGAG
CGTTCGATGC GCAAGGACGC GGCGGGCGCC GGGCTGATCG CGACCATCAC CCAATTCTTC
AGCAACCCCC TGCTGCTCAT GGCGTCGCTG GCCAGCGGCG CCGGCAACTT CATCACCTAC
GGCCTGCTCA ATTTCACGAC GCTGTTCCTC ATGCGCGAAA AGGGCATGCA ACTGGCCGAC
GTCGCGATCT GGTATGCCCT GGTCGTCGGG ATCGGCATGA GCGCCGGCAT CTACGCCTCG
GGGCGGATCG TCGATCGGTT CGCCGCGCGC AGCAAGACCG CCTATGCAAT CGTCCCGGCC
GCCTCGCTGC TGCTCGCCCT GCCCTTCTTT CTGGGCTTCG CCTGGGCTCC GACCTGGCAG
TTGTCTTTGC TCTTCCTGCT TGTTCCCATG TTCCTCAATT CGTTCTTCCT GTCGGCCACC
GTCACCTTCG TCCAGAGCGA GGTCCCTGCC GAACGGCGGG TGATTTCCGG CGCGCTGCTG
CTGCTGGTGA TGAACTTCAT CGGCCTGGGC CTTGGACCGA CCTATGTCGG CATGGCCAGC
GACTACTTCC GGCCGGTCCA CGGCGCCCAT GCCCTGCGGG CCGCCTACTA TGCGCTCGCC
CCGATGTACC TGATCGGGGC CGCGCTGTTC CTGGTCCTCG CGCGCCTCAT CCGCCGCGAC
GAACGCCTCC ACGAAGGAGC CCTCTGA
 
Protein sequence
MKMKMSEDAN VEMAPTVETP RRQTAAPSGA RITLAMLCFV YVLNFLDRQL ISILAKPIQD 
GLKISDGQLG LLTGFYFALF YCFIAIPIGW LADRTSRVRV LAIACALWSG ATAACGLVGN
YGQLVVARMM VGVGEAGGVP PSYAIISDSF PRERRTTAMA IFNLGPPIGS ALGITFGASL
ASAFSWRIPF YVIGVIGVVA AVAVHLIVRE PKRGQMDSPE RSMRKDAAGA GLIATITQFF
SNPLLLMASL ASGAGNFITY GLLNFTTLFL MREKGMQLAD VAIWYALVVG IGMSAGIYAS
GRIVDRFAAR SKTAYAIVPA ASLLLALPFF LGFAWAPTWQ LSLLFLLVPM FLNSFFLSAT
VTFVQSEVPA ERRVISGALL LLVMNFIGLG LGPTYVGMAS DYFRPVHGAH ALRAAYYALA
PMYLIGAALF LVLARLIRRD ERLHEGAL