Gene Caul_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0427 
Symbol 
ID5897701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp467441 
End bp468466 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID641560913 
Productxylose isomerase domain-containing protein 
Protein accessionYP_001682062 
Protein GI167644399 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC TCAAAGGCCC GGCGATCTTT CTGGCCCAGT TCGCCGGCGA CGCCGCGCCG 
TTTGACACGC TTGAGAACCT GGCCGCGTGG GCCGCCGGGC TTGGATACAA GGGTGTTCAG
GTCCCCACCG ACAATCCGGC GATCTTCGAT CTGACCCTGG CTGGCCAAAG CAAGACCTAT
TGCGATGAGG TCAAGGGCCG GCTGGCGCAG ATCGGCGTGG AGATCACCGA GCTGTCCACC
CACCTTCAGG GACAGCTGGT GGCGGTGCAT CCGGCCTATG ACGAGCTGTT CGACGGTTTC
GCCGCGCCGC AGGTGCGCGG CAAGCCGGTC GAGCGCCAGG CCTGGGCCGT CGAGCAATTG
AAGTCCGCCG CCCGGGCCAG CGCCCACCTT GGGCTTTCGG CCCACGCCAC CTTCTCCGGC
GCCCTGGCTT GGCACCTGGT CTATCCCTGG CCGCAGCGGC CGCCCGGCCT GATCGAAGCG
GCCTTCGAGG AGTTGGCGCG ACGCTGGCGA CCGATCTTGG ACGCCTTCGA TGAGGCCGGC
GTGGACGTCG CCTACGAGAT CCACCCGGGA GAGGACTTGC ACGACGGGGC GACCTTCGAG
CGGTTCTTGG CGGCGGTCGA TGACCACCCG CGCGCCAATA TCCTGTTTGA TCCCAGCCAC
TTCGTTCTGC AGCAGCTGGA CTATCTCGAT TTCATCGACC GCTATCACCC GCGCATCAAG
GCGTTCCACG CCAAGGACGC GGAGTTTCGG CCCAATGGTC GCAACGGCGT CTATGGCGGC
TACCAAAACT GGATCGACCG CGCCGGCCGC TTCCGCTCCT TGGGCGATGG CCAGGTCGAT
TTCAAATCCA TCTTCAGCAA GCTGGCTCAG TACGACTTTG ACGGCTGGGC GGTGCTGGAG
TGGGAGTGCT GCCTCAAACA TCCCGAGGAC GGCGCCCGAG AAGGCGCGGC CTTCATCCGC
GACCACATCA TCCGCGTGAC CGACCGAGCC TTCGACGATT TCGCCAAGGT CGTTCCGACC
CGCTGA
 
Protein sequence
MKTLKGPAIF LAQFAGDAAP FDTLENLAAW AAGLGYKGVQ VPTDNPAIFD LTLAGQSKTY 
CDEVKGRLAQ IGVEITELST HLQGQLVAVH PAYDELFDGF AAPQVRGKPV ERQAWAVEQL
KSAARASAHL GLSAHATFSG ALAWHLVYPW PQRPPGLIEA AFEELARRWR PILDAFDEAG
VDVAYEIHPG EDLHDGATFE RFLAAVDDHP RANILFDPSH FVLQQLDYLD FIDRYHPRIK
AFHAKDAEFR PNGRNGVYGG YQNWIDRAGR FRSLGDGQVD FKSIFSKLAQ YDFDGWAVLE
WECCLKHPED GAREGAAFIR DHIIRVTDRA FDDFAKVVPT R