Gene Caul_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2131 
Symbol 
ID5899586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2298558 
End bp2299727 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content69% 
IMG OID641562620 
Productmannose-6-phosphate isomerase 
Protein accessionYP_001683757 
Protein GI167646094 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0831943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.49675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCGC GGCCTGATCC TTCGGCCCGC GCGAGCGCCG GCTACGCCCG CCTGCGGAAC 
TGGCTGGTGG AGGCCGCCGT TCCGCTCTGG AGCCAGGCGG GCGTCGAGGC GTCCGGCGCC
TTTCACGAGA CTCTCGACCC GCGGGGCCCG CCCATCGACG GACCGCGTCG CGCCAGGGTC
CAGCCGCGCC AGCTCTACGC GCTGCAGACC GCGCGCAAGC TGGGCGCCAG CCTTGATGAC
GCGATACTGC GTCAGGGTCT TCACGCGTTC ACGACCCGAT ATGTCCGGTC CGACGGACTG
ATCCGGACCC TCGTCGCGGG CGACGGACGG GTTCTCGACG AGACCGCGGT CCTCTACGAT
CAGGCCTTCG CGCTCCTGGC CCTGGCGAGC TTGGCGCCGT TATTGGGTCA GGAGGGCGAG
CGAAGCGCGA TGGTCCTCCT GCGGGCGATC GGCGTCTATT TCGGACGTCC GGGCGGGTTC
GAGACGACGC TCCCGGCTTC TCTGCCGCTG GCGTCCAATC CGCATATGCA TCTGCTGGAG
GCCTGCCTGG CGTGGATGGC CGCGGGCGGT GATCCGGCCT GGGCCCAGAC GGCCGGCCAG
ATCGTCACCT TGACCCTGGA TCGCTTCATC GATCCGGTTA ATGGAGCCTT GCGGGAGTTC
TTCGACGGCG ACTGGCGCCC GGCGGCGGGA GAGCCGGGAC GGATCGTCGA GCCTGGGCAT
CAGTTCGAAT GGGCGTGGCT GCTGCTGAGC TGGAGCCGTC TGGCCAAGAC CGATCCTCTG
AGCGTCCCCG CGCGGGTGGC GGCCCTGCGC CTGATCGCCA ACGCCGAGGA CTTCGGCGTC
GATCCGTCGC GCAACGTCGC GATGAACAGC CTGCTCGACG ATTTCTCGAT CCATGACGCG
AAGGCCAGGC TCTGGCCGCA GACCGAGCGG ATCAAGGCTT GGGCCTTGGC GAGCACGCAA
CTGGACGCGC CGTGGTGGGA CACCGTCGCC GACGCCGTCG AAGGCCTGGA GCTCTATCTG
CGCACGCCGG TCCAGGGCCT GTGGTTCGAC AACATGACCG CCGATGGCGA TCTCGTCGAC
GGTCCAGCGC CGGCCAGTTC CTTCTATCAT ATCGTTTGCG CGATCGAGGT GCTCGGCCGG
GCCCTGGGAG ATGTCCGTTG CGCGGCATGA
 
Protein sequence
MKSRPDPSAR ASAGYARLRN WLVEAAVPLW SQAGVEASGA FHETLDPRGP PIDGPRRARV 
QPRQLYALQT ARKLGASLDD AILRQGLHAF TTRYVRSDGL IRTLVAGDGR VLDETAVLYD
QAFALLALAS LAPLLGQEGE RSAMVLLRAI GVYFGRPGGF ETTLPASLPL ASNPHMHLLE
ACLAWMAAGG DPAWAQTAGQ IVTLTLDRFI DPVNGALREF FDGDWRPAAG EPGRIVEPGH
QFEWAWLLLS WSRLAKTDPL SVPARVAALR LIANAEDFGV DPSRNVAMNS LLDDFSIHDA
KARLWPQTER IKAWALASTQ LDAPWWDTVA DAVEGLELYL RTPVQGLWFD NMTADGDLVD
GPAPASSFYH IVCAIEVLGR ALGDVRCAA