Gene Caul_3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3774 
Symbol 
ID5901236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4090480 
End bp4091766 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID641564297 
Productmajor facilitator transporter 
Protein accessionYP_001685399 
Protein GI167647736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.108427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGCT GGTACCGCGA GGCGGACCCC AAGACGCGCC GCGTCTTCTG GACCTGCGGC 
GCCGGCTGGG CGATGGACAC CGCCGACGGG CTGGTTTTCC AGTACTTGAT CCCGGTCCTG
ATGGTCGCGT TCGGCATGAC GCTGGCCCAG GCGGGCTATA TCGCCAGCGC CAACTACGTG
GCCGCCGCGG TCGGGGGCTG GCTGGGCGGC TGGCTGAGCG ACCGCTTTGG CCGCGCGCGG
ATCCTGCAGC TGACGATCCT GTGGTTCTCG GTCTTCTCGT TCCTGTCGGG CTTTGCTCAG
ACCTACGAGC AGCTGCTCGC GGCGCGGGTG CTGCAAGGCA TCGGGTTTGG CGCGGAATGG
GCGGTCGGCG CCGTGCTGCT GGGTGAGATG ATCGCGCCGA AGCATCGCGG CAAGGCGCTT
GGCGTGGTGC ACAGCGGCGC GGCGATCGGG TCCGGCATCG CGGCCTTGCT GGCGGGTCCG
TTCGCGGCGG CGTTCCCGAG CGACATCGGC TGGCGCGCGG TGTTCTGGAT CGGTCTCCTG
CCCGCCATAC TGGTGTTCTT CGTTCGCCGG GGTTCGGACG ACCCCGAGAT CTATCGGGCC
GCGGCGCGGC GCGCGGCCGA GACCGGCAAC AGGCCGAAGA TCGCCGACAT CTTCGGTCGA
CGGGTGGTGC GCACCACCAT ACTGGCGTCA TTGCTCTCGC TGGGCACCCA GGGCGCGGCG
TTCGCGATCA GCAACTATCT TACGTCCTTC CTGACGATCG AGCGCCACAT GACCGTCTCG
ATGGCCGGAA TGTGCGTGCT GTTCAACAGC CTGGGCGGGT TCTTCGGCTT CCTGGTCAAC
GCCTACATCT CCGACCATGT CGGCCGCCGG GGCGCCTTTC GTCTGTTCGG GGCCGGCTTC
ATCCTGACCG CGTCGGTCTA TCTGTTCGCG CCTCTGGGCA ACTCGCCCGC CATCCTGATC
CCGGCCGGCC TGATCTACGG TTTCTTCCAG TTCGGGATCT ACGCCTCGTT CGGACCCTAC
TTCACCGAGC TGTTCCCGAC CGAGGTGCGC GCCACCGGAC AGGCCTTCGC CTATAATTTC
GGTCGCGGGG GCGCCGCGCT GTTCATCACC GGAGTCGCCC TGCTGGCGGG GACGCTGCCG
CTGAGCGCGG CGATGGCCGC CGTGGCGATC ACCGGCATGG CGCTCTCGAT CGCGGCGACC
CTGGCCCTGC CGGAGACTGC GGGGCGCGCG CTGCATAGTC TTGGCGACAT AGACGCCCGT
GAACTGGCCG GCGTTCCGCC CGACTGA
 
Protein sequence
MMGWYREADP KTRRVFWTCG AGWAMDTADG LVFQYLIPVL MVAFGMTLAQ AGYIASANYV 
AAAVGGWLGG WLSDRFGRAR ILQLTILWFS VFSFLSGFAQ TYEQLLAARV LQGIGFGAEW
AVGAVLLGEM IAPKHRGKAL GVVHSGAAIG SGIAALLAGP FAAAFPSDIG WRAVFWIGLL
PAILVFFVRR GSDDPEIYRA AARRAAETGN RPKIADIFGR RVVRTTILAS LLSLGTQGAA
FAISNYLTSF LTIERHMTVS MAGMCVLFNS LGGFFGFLVN AYISDHVGRR GAFRLFGAGF
ILTASVYLFA PLGNSPAILI PAGLIYGFFQ FGIYASFGPY FTELFPTEVR ATGQAFAYNF
GRGGAALFIT GVALLAGTLP LSAAMAAVAI TGMALSIAAT LALPETAGRA LHSLGDIDAR
ELAGVPPD