Gene Caul_0318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0318 
Symbol 
ID5897592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp359940 
End bp361103 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content66% 
IMG OID641560802 
Productsugar isomerase (SIS) 
Protein accessionYP_001681953 
Protein GI167644290 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2222] Predicted phosphosugar isomerases 
TIGRFAM ID[TIGR02815] putative sugar isomerase, AgaS family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.500662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACA TCACCTATCT GGGCCTCGAA GAGTCCGAGC TGGACCGCCT GGGCGGGCTG 
TGGACCGCGC GCGAGATCGC CCAGCAGCCG GCCATGCTGC GCGAGACGCA AGCGCTTCTG
ATGGCGCGGC GCGGCGAGAT CGAGGCCTTC CTCAAGCCGC TTCTGGCTTC GCCCGGTCTG
CGCGTCATCC TGACTGGCGC CGGGACCTCG GCCTTTATCG GCGAATGCCT GGCGCCGGTC
CTGTCCACGC GCCTGGGCCG CCGTGTCGAG GCGATCCCGA CGACGGATCT GGTCTGCGCC
CCGCATCTCT ATTTCGAAAC CGATACGCCG ACGCTGCTCG TCTCCTTCGG ACGTTCGGGC
AACAGCCCGG AAAGCGTCGC GGCCGTCGAG CTGGCCGACC GTCTCGTCAA GACCCTGCAT
CACCTCGTCA TCACCTGTAA CGCCGAAGGC GCCCTGGCGG CCTATGCGCG CGGATCGCGC
GGCCTGACCG TGCAGCTTCC AGAAGCCACG CACGATCGCG CCTTCGCCAT GACGTCCAGC
TTCTCGTGCA TGACCTACGC GGCGCTGGCG GTCTTCAGCG GTATCGCGAC CATGGACGAG
CGTATCGACT CCATCGCCCG CGCCACCCAG AGCGTGATCG CCACCTATAC CTCGGTCATG
CGCGCCGCGG CCGCCGAAGG CTATGAACGC GTCGTCTACC TCGGCAGCCA TATCTTCCAG
GGACTGGCCC GCGAGTCCGG GCTCAAGCTT CTGGAGATGA CCAACGGCCA GCTGGTGACC
ATGTTCGATT CGCCTCTGGG CTTCCGCCAC GGTCCAAAGA CCATCGTCAA CGACCGCACC
CTGATCGTCG TCTTCTTCTC CAACAACGCC TATACGCGCA GCTACGATGT CGATCTTCTG
GACGAGTTGC GCCGCGACAA CGACGCCGCC CGCGTCATCG CCTTGACGGC GCAGGACGGG
GTAGGATTGC CAAGGCGCGA CGAGCTCAGC GTTCCAGGCC TGGCGATGGC CGACGACGCC
GAGCTGCTCT TTCCCTATAT CGTGGCGCCG CAGATCTTCG CCTTCTTCGA GTCCCTGCGC
CTGGGGCTGA CGCCGGACAA GCCCAACACC TCGGGCACGG TCAACCGCGT CGTGCAGGGC
GTGCGCATCC ACGAGCTGAG CTGA
 
Protein sequence
MSDITYLGLE ESELDRLGGL WTAREIAQQP AMLRETQALL MARRGEIEAF LKPLLASPGL 
RVILTGAGTS AFIGECLAPV LSTRLGRRVE AIPTTDLVCA PHLYFETDTP TLLVSFGRSG
NSPESVAAVE LADRLVKTLH HLVITCNAEG ALAAYARGSR GLTVQLPEAT HDRAFAMTSS
FSCMTYAALA VFSGIATMDE RIDSIARATQ SVIATYTSVM RAAAAEGYER VVYLGSHIFQ
GLARESGLKL LEMTNGQLVT MFDSPLGFRH GPKTIVNDRT LIVVFFSNNA YTRSYDVDLL
DELRRDNDAA RVIALTAQDG VGLPRRDELS VPGLAMADDA ELLFPYIVAP QIFAFFESLR
LGLTPDKPNT SGTVNRVVQG VRIHELS