Gene Caul_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0422 
Symbol 
ID5897696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp461924 
End bp462925 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content61% 
IMG OID641560908 
Productxylose isomerase domain-containing protein 
Protein accessionYP_001682057 
Protein GI167644394 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCA AAAAGGGCCT CAAGCTCGGC ACGACGCTCT ACAGCCTCAC CAACGAGTTT 
CATTGGCGAA AATATGATTT CGAAGGCCTG GTTCGCAGGG TCGCGGCCGA GAACCTTGGG
CCGGGGCTAG AGGTTGTCGG CTTCCAAAGC ATCAAGGGCT TCCCGGTCAT CACCGACGCC
TACGCCGAAT GGTTCAAGGC GCTGATCGCC GAGACCGGGC TTGAGCTGTC ATGCCTGGGC
ATCAACGCTG ACAACGCCAT TCGTCGCGAC CGCGACATGA CGGTCGAGGA GTCGGTGACC
TATCATCAGG CTCAGATCGA TGCGGCGGCC AAGCTAGGCT TCCCGGTGGC GCGTTACCAA
TACCCGGCCG GCGTCGAGGT CATCCGCCGC CTAGAGCCCT ATGCGGCCGA AAAAGGCGTC
AAACTCGGGC TCGAAATCCA TTCGCCCCAC ACCGTCCACA CGCCCGACAT CATGAAATAT
CGTGAGCTCT ATGACACGCT GAGCTCGCCC TATCTCGGCT TTGTGCCCGA CTTCGGCTCA
TCGGTGGTGG GCATTCCTCC GATGGTCATC GCCCGTTTCC GTGCGGGCGG CGCGTCCGAG
ACCCTGATCG ACATCGTTCT GGAGGAGTGG CGTAGCGACG CCCCGGTGAT GGAGAAGCAG
GCCAGCTTCC GCAGGCGCGG CGAAGCGGCC GGGGCCAATG TGGAGACCCT GAACCGTCTG
GCCTTTGTCT TCGGCTATTT CAGTCGCCAG GCGCCGCAGG ACTGGGCCGA GATCATGCAC
CAGGTCGTGC ACATCCACGG CAAGTTCTTC GACTTCAATG ACCAGGGCGA AGAGAACTCC
GTGCCCTATC CGGAAATCCT CAAGGTCTTC GTTGACGGCG GCTACGACGG CTACATGTCC
AGCGAGTACG AGGGCCATCT GTTCTCGGAC GACGACGGCT TCGACAAGCT GCTCGCCCAC
CATGCCCAAT GCCAGCGCAT CCTCGATCGG CTGCAAGCCT AG
 
Protein sequence
MSTKKGLKLG TTLYSLTNEF HWRKYDFEGL VRRVAAENLG PGLEVVGFQS IKGFPVITDA 
YAEWFKALIA ETGLELSCLG INADNAIRRD RDMTVEESVT YHQAQIDAAA KLGFPVARYQ
YPAGVEVIRR LEPYAAEKGV KLGLEIHSPH TVHTPDIMKY RELYDTLSSP YLGFVPDFGS
SVVGIPPMVI ARFRAGGASE TLIDIVLEEW RSDAPVMEKQ ASFRRRGEAA GANVETLNRL
AFVFGYFSRQ APQDWAEIMH QVVHIHGKFF DFNDQGEENS VPYPEILKVF VDGGYDGYMS
SEYEGHLFSD DDGFDKLLAH HAQCQRILDR LQA