Gene Caul_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1930 
Symbol 
ID5899385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2069979 
End bp2071334 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content68% 
IMG OID641562420 
Productmajor facilitator transporter 
Protein accessionYP_001683557 
Protein GI167645894 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.604582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGAC AGGTCCCAAC GACACCGCCG GAGGCGGGTT ATCCGCCCAT TCGCTATGCC 
TGGTATGTCA TCGGGGTCCT GTTCGTGGCG ACCCTGCTTT CGCAGTTGGA TCGCCAGTTG
CCCGCGCTGC TGGTGCGGCC GATTCGGGCC GAGTTCGGCA TTTCCGACAC CGCCTTCAGC
TTCCTTCAAG GTTACGCTTT CGCGCTGTTC TATACCTTCG CGGGGCTTCC GTTCGGCTGG
CTGATCGATC GCACCGTTCG CCGTAACCTG ATCATCATCG GCATGGTCCT GTGGAGCGTG
ATGACGGTCC TGTCGGGCTT CGCGCAGAAC TACAACGCGC TCGTCCTGAC CCGGATGGGC
GTCGGCATCG GCGAGGCGGT TCTGGCGCCC GCCGCCTATT CGATGATCGC CGACTATGTC
TCGCCCCAAC GACGGGGAAG GGCGTTCAGC GTCTACTATC TGGCCCTGGC CATCGGGTCC
GGCGCGTCCC TGATCCTGGG GGCTCTGATC GCTCGCGTCA TCCCCGACGC CGGCCTCAGC
CTGCCGGGCG TGGGGCTCAT GTCGCCCTGG CGGCTGACCT TCCTGATCGC GGGCGCGCCC
GGGCTGCTGC TGGCGCTGCT GTTGTTCACG ATCCGCGAGC CCGTGCGTCG TGACGCCGCC
GTCCTGGCGT CGCAAGCGGG CGCCGGGTGG AGAGACTTTC TCGGTTACCT GAAGCGTCAC
TTCGCCACCT TCTCGCGCGT CCTGACCTAT CCGGGTGTCG TCGCCGTCGT CGGCTATGGC
ACGCTGGCCT GGGCCCCGGC CTTCTTCGAC CGCCGATTCG ACATTCCGCC CAAGACCTCG
GGCCTGATCA TCGGCGTGCT GGTGGCGGGC GGCGGGCTGG TTGGCACCCT GATCAGCGGC
TGGCTCAGCG ATCGCTGGAC CGCCAAGGGC GATCCGGCCG CCCGGTTGCG GGTGGCCATG
CTGGCGTGGC TGCTGGTGCT GCCGACGGTC TGCGCGTGGT CGCTGGTGGG CGTTCCGTGG
CTGAGCTTCG CCCTGCTGAC CGTGGTCATC ACGGGCTTTG GCATGGCCCA GGCGGCGGCG
CCGACGGCGG TCCAGGAAAT CACCCCAAAC CGCATGCGCG GCAAGGCGGT GGCCGTCTAT
CTGTTGATCG GCGGCCTGGT CGGCATTGGC TTTGGCCCCA TGTCGATCGC CCTGGTGACC
GACCATGTGT TTAAGAGCGA CGCCGGCCTG CCCTATGCCC TGGCCCTCGT CGGAGGGCCG
ATGTCCCTGC TCGGCCTGTG GCTGACCTGG TCGGGCCTGA AGCCTTACGG GCGCACGGTC
GAGGCCTTGA AGGCCGAGGC CGCGCGAACG TCCTGA
 
Protein sequence
MHGQVPTTPP EAGYPPIRYA WYVIGVLFVA TLLSQLDRQL PALLVRPIRA EFGISDTAFS 
FLQGYAFALF YTFAGLPFGW LIDRTVRRNL IIIGMVLWSV MTVLSGFAQN YNALVLTRMG
VGIGEAVLAP AAYSMIADYV SPQRRGRAFS VYYLALAIGS GASLILGALI ARVIPDAGLS
LPGVGLMSPW RLTFLIAGAP GLLLALLLFT IREPVRRDAA VLASQAGAGW RDFLGYLKRH
FATFSRVLTY PGVVAVVGYG TLAWAPAFFD RRFDIPPKTS GLIIGVLVAG GGLVGTLISG
WLSDRWTAKG DPAARLRVAM LAWLLVLPTV CAWSLVGVPW LSFALLTVVI TGFGMAQAAA
PTAVQEITPN RMRGKAVAVY LLIGGLVGIG FGPMSIALVT DHVFKSDAGL PYALALVGGP
MSLLGLWLTW SGLKPYGRTV EALKAEAART S