Gene Caul_5135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5135 
Symbol 
ID5897345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp54795 
End bp55994 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID641555238 
Productmajor facilitator transporter 
Protein accessionYP_001676569 
Protein GI167621784 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.774559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACCC GTCCGTCCCC GGCGCGAGAC GACGCCGCCC CCCGTCTGGC CAAACCGGTC 
GAACGCTGGG CCGCCGTCTT CTCGGTCGCC GTCGCCTCCT TCGCGCTGGT GACCACCGAG
TTTCTGCCGG TGGGCCTGCT CAGCGCGATC GCCCAGGATC TGGGCGTCAG CGTCGGGGCG
GCCGGGTTGA TGATCAGCCT GCCTGGCGCG ACGGCCGCCC TGGCCGCACC GATCCTGACG
GTGCTTAGCC GCACCCTGGA TCGGCGTGTC CTGCTGCTGG CGATGACCGC CTGCTTGATT
GGTGCGGACG TCGTCTCCGC CCTCACCCCC AACTTTGCCT TGATGCTGGC GGCTCGGGCG
GTGCTGGGGG TGGCGATCGG CGGCTTTTGG GCGGTCGGCG CGGCGGTGGG CGGGCGCCTG
GTGGGCGAGA CGCAAGCGGG CCGCGCCACC GCGATTATCT TTTCGGGGAT CTCTCTGGGC
GCCCTGCTCG GCGTGCCGGT GGGCGTCTTC CTGGGGGCGC TTTCCGGTTG GCGCACGGCC
TTTTGGGCGG CCGGAGGGCT CTCTCTGGTC ATCCTGATCG CCCAGGCGGT CTGGCTGCCC
AAGCTGCCGG GCCTGCGCGC GGTGCAGGTC AAGGACCTGT TCGGGATCTT CGCCAATCGC
AACGCCCGCG TGGGCCTGCT GGCAGTGTTT TTCGCGGTGG CCGGCCAGTT TGCGGCCTAT
ACCTTCGTTA ATCCCGTTCT CCTGGACGTT ACGCGCCTGA CGCCCACGGC GCTGAGTCAG
GTGTTCTTCG CCTACGGGGT GGCGGGCTTT TTCGGCAACT TCCTGGGCGG GCATGGCGCG
GGCAAGAACG TTCGCGCCGC CAAGTTCGTG GTGTTGCTGG CCTTGGGCGG CGTCATCATC
GCCTTTGCCC AGTTGGCGGC CCACCCGCCG GCGGCGATCT TGCTGCTGAC GGCCTGGGGA
CTGGTCTGGG GCGGCCTTCC GATCTTGCAC CAGGCCTGGG CTATGCGCGC GTCGCCGGGC
ATGGCCGAAG GCGGATCGGC CCTCTTCGTT TCAGTGTTCC AAGGCTCGAT CGCCATCGGC
TCGGGCTTGG GCGGCGCGGC CGTGGAGACG GTGGGCCTGG TGTGGGGGTT GACCCTGGGC
GGGCTGTCGA TCCTGATCGC GCTCGTGGTT TCGGTGCTCG GGTCCAAGCC TTTCCGCTAG
 
Protein sequence
MMTRPSPARD DAAPRLAKPV ERWAAVFSVA VASFALVTTE FLPVGLLSAI AQDLGVSVGA 
AGLMISLPGA TAALAAPILT VLSRTLDRRV LLLAMTACLI GADVVSALTP NFALMLAARA
VLGVAIGGFW AVGAAVGGRL VGETQAGRAT AIIFSGISLG ALLGVPVGVF LGALSGWRTA
FWAAGGLSLV ILIAQAVWLP KLPGLRAVQV KDLFGIFANR NARVGLLAVF FAVAGQFAAY
TFVNPVLLDV TRLTPTALSQ VFFAYGVAGF FGNFLGGHGA GKNVRAAKFV VLLALGGVII
AFAQLAAHPP AAILLLTAWG LVWGGLPILH QAWAMRASPG MAEGGSALFV SVFQGSIAIG
SGLGGAAVET VGLVWGLTLG GLSILIALVV SVLGSKPFR