Gene Caul_4916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4916 
Symbol 
ID5902378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5311456 
End bp5312616 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content68% 
IMG OID641565436 
Productmannose-6-phosphate isomerase 
Protein accessionYP_001686534 
Protein GI167648871 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.982675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGCCC AAGATGCCCT CCTCGCCGTA CCGTTCGACG AAATCCGCCG CTGGATGTTC 
GACGAGGCTC TGCCCTTCTG GGCTGAATAC GGGGTCGACC GCGCGGACGG GGGGTTCGTC
GAACAGCTCG ACTTCGAAGG CCGCGATGTG GGCGTGGACT TCAAGCGCAC CCGCGTCACG
TGCCGGCAGA TCTACGTCTT CTCCCACGCC GCGGTCCTCG GCTGGGAGCC TGGCAGGGCG
CTCGCCGACC ACGGTGTGGC GTTCCTCAAG AAAGCGTGGT TGGGGCCCGA CGCCGGCTGG
GCCCGGTCGC TGACTCGCCA GGGTGACGTC CTGGACGCCA CGCCCGATCT CTACGACATC
GCCTTCGCCC TCTTCGCCCT GGGGTGGCAC GTTCGCGCCA CCGGTGACGC CGACTCCGCC
GAACTGGCTC GGCAGACCCT CGACTTCACC GAACGCCACA TGCGCCCTGC CCAGGGTCGC
GGCTTCCTGC ACGAAAAGCC GGCCAAGGGC TGGCGTCTGC AAAATCCCCA CATGCACCTG
ATGGAAGCCG CGCTGTCCTG CCTCGAGGCC ACCGGCGACC CGCGCTACGC CGAACTGGCC
AAGGAGCTGG AGGGCCTTTT CCGCGACAAG CTGTTCGTCC CCCAAAGCCA GACCCTGGCT
GAGTATTTCG ACGACGACTG GAATCGCGCG CCCAGCGACG ACGGCCGGAT CATCGAGCCC
GGCCACCAGC TGGAATGGGC CTGGATCCTG GCCAATCTCG AGCGCCTGAC CGGCGCCAAG
ACCGAGGACC TGGTGCGCGG CCTAACGGAC TTCGCCGAAC GCCACGGCGT GGATCCCGAG
ACCGGCGTGA CCTACAATCA GGTGCGAGAC GACGGCGTCG CGCTGGATCG CGGATCGCGC
ACCTGGCCCA ACACCGAGCG GCTGAAGGGC CACGTCGCGC GCTTCGAGCA ATGGGGCGAA
GACCCGCGCC GGGCCTTGAC CAGCTCAAGC CGCGTGCTGC TTGACCGCTA TCTGGGCTAC
GGGCTTCCCG CCCTGTGGCT GGACCATTTC GGGCCCGACG GCGAGCATCG CGTGAACTAC
GCGCCAGCCT CGACGCTGTA TCACGTGTTT CTGGCGTTCG CGGAAGTTCT GCGAATCGAG
CCGCGCTTGG CCGGGATCTA G
 
Protein sequence
MPAQDALLAV PFDEIRRWMF DEALPFWAEY GVDRADGGFV EQLDFEGRDV GVDFKRTRVT 
CRQIYVFSHA AVLGWEPGRA LADHGVAFLK KAWLGPDAGW ARSLTRQGDV LDATPDLYDI
AFALFALGWH VRATGDADSA ELARQTLDFT ERHMRPAQGR GFLHEKPAKG WRLQNPHMHL
MEAALSCLEA TGDPRYAELA KELEGLFRDK LFVPQSQTLA EYFDDDWNRA PSDDGRIIEP
GHQLEWAWIL ANLERLTGAK TEDLVRGLTD FAERHGVDPE TGVTYNQVRD DGVALDRGSR
TWPNTERLKG HVARFEQWGE DPRRALTSSS RVLLDRYLGY GLPALWLDHF GPDGEHRVNY
APASTLYHVF LAFAEVLRIE PRLAGI