Gene Caul_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1938 
Symbol 
ID5899393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2078103 
End bp2079434 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID641562428 
Productmajor facilitator transporter 
Protein accessionYP_001683565 
Protein GI167645902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.829808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.641646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA CGCTGACCGA CAAACTCGCG AACGCCAAGG ACTCCCGCTA TCGCTGGCTG 
GTCCTGGCGG TGCTGACCGC CGTGCATTCG ACCCACCACA TCGACCGCAA CGTCCTGTCG
GTCGTCGTCG AGCCGATCCG GCAGGAGTTT CATCTCAGCG ACAGCCAGAT GGGAATGCTG
GGCAGCCTGG GCTACGCGCT GGCCTTCGCC ATCGCCGCGA TACCCATGGG GTATCTGGTC
GACCGGGTGA ACCGCCGTAA CATGCTGGTC GGCATCCTGG CGCTGTGGAG CGTGATGACG
GCGGTCTGCG CCTCGGCCAA CAGCTACGTG CACCTGTTGC TGGCCCGGAT GGGCGTCGGC
ATCGCCGAGT CCGGCGGCGC CCCGACCGCC ATGTCGATGG TCTCTGACTA TTTCCCGCCC
AAGCAGCGGT CGACGGCGAT CGGCATCTGG TACCTGAGCT CGGCGATCGG CACCGGGATC
ATCTTCCTGG TCGGTGGCTT CCTGGCCCAG TCGTTCGGCT GGCGCACGGT GTTCCTGGTG
GCGGGCGTAC CCGGCCTGGT GATGGGTCTG ATCCTGTTCT TGGTCGTGCG CGAACCCCCG
CGCGGCGGAT CGGAGGTCGT GGCCCTCGAT ACGCCGGAAA CCACGCCCGC CGCGACCGTC
GACACCCCGG AAAAAGCCGC CACCCCGCGC GAGGCCTTCG CCTACGTGAT CCGCCGCCCG
GCCATTCTGA GCATGATGGC CGGCATCGTC CTGGCCGCCG CGATGAGCTC GGCCTTCGCC
CTGTGGTCGG TGTCGTTCCT CGTGCGGGTT CACCACATGC CGCTGGCCCT CGCCGGCGTA
TCGATCGCCG CGGCCTTCTC GGTGTTCGGC ATCATCATTC CGTTGATTTC CGGCGTGATG
GGCGACCGGC TGTCGAACGC GAAGGACGGT CACAGGCCCG AGCGCCTGGC CCTGCTCAGC
GCCACGACCA TGACCGGCGT GGTCCTCTGC GGCGTCGCGG CCGCCTTGTC CGGCAGCGCG
CCCGTCGCGG TGGCGATGAT GTGCCTGTGG TGCGGTCTGA TGCTGGCCCA CAACGGACCG
GCCAACGCCC TGGTCCTCAG CCTGCTTCGC CCCCGGATGC GGGGGGTCGT CGTCGCCACG
CTGCAGACCG TCGCGACGGT GGTCGGCACG GCGCTGGGCC CCTTTCTGGT GGGCGTGCTC
AGCGACGTCT ATGGCGGCCC CAACTCGCTG CGGTGGGCCA TCATGACCGG CATGTCGCTG
AACGTCGTGG CGGTGCTGTG CTTCCTCAAC GCGGCTAGGA CCGCCCGCCG GGATTCCCTG
CTGGACGGCT AG
 
Protein sequence
MPKTLTDKLA NAKDSRYRWL VLAVLTAVHS THHIDRNVLS VVVEPIRQEF HLSDSQMGML 
GSLGYALAFA IAAIPMGYLV DRVNRRNMLV GILALWSVMT AVCASANSYV HLLLARMGVG
IAESGGAPTA MSMVSDYFPP KQRSTAIGIW YLSSAIGTGI IFLVGGFLAQ SFGWRTVFLV
AGVPGLVMGL ILFLVVREPP RGGSEVVALD TPETTPAATV DTPEKAATPR EAFAYVIRRP
AILSMMAGIV LAAAMSSAFA LWSVSFLVRV HHMPLALAGV SIAAAFSVFG IIIPLISGVM
GDRLSNAKDG HRPERLALLS ATTMTGVVLC GVAAALSGSA PVAVAMMCLW CGLMLAHNGP
ANALVLSLLR PRMRGVVVAT LQTVATVVGT ALGPFLVGVL SDVYGGPNSL RWAIMTGMSL
NVVAVLCFLN AARTARRDSL LDG