Gene Caul_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1761 
Symbol 
ID5899216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1859032 
End bp1860138 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID641562251 
Productribokinase-like domain-containing protein 
Protein accessionYP_001683388 
Protein GI167645725 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.550108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA ACATCCTCGA CATCCGCCCG GCTTCGGAAA CGAAGTGGGA TTGCGCCAGC 
TTCGGTGAAG TGATGCTGCG TTTCGACCCC GGCTTCGGCC GGGTTCGCAA CGCGCGCCAG
TTCAACGTCT GGGAAGGCGG CGGCGAATAC AACGTCGCCC GCGCCTTCAG GAAGTGCTGG
GGCAAGCGCT CCACCGCCGT CACCGCCCTG CCGGTGAACG ATCTGGGCTG GCTGGTCGAG
GATCTGATGA TGCAGGGCGG CGTCGACACC TCCCACATCA TCTGGCGCGA CTTCGACGGC
CTGGGCCGCA ACACCCGGGT TGGCCTCAAC TTCACCGAAA AGGGCTTCGG CGTTCGCCCG
GCCCTGGGCT GCAGCGACCG GGGCCACTCG GCCGCCTCGC AGATCCGTCC CGGCGAAGTG
AACTGGGAAA AGCTGTTTGG CGAGGAGGGC GTGCGCTGGT TCCACACCGG CGGCATCTTC
GCGGCCCTGG CCAGCAACAC TGCAGAGGCC GTGATCGAAG CGGTCGAGGT GGCCCGCAAG
TACAGCACGG TGATCTCCTA CGACCTGAAC TACCGTGCCT CCTTGTGGAA GTCCCAGGGC
GGCAAGGAGG GGGCCCAGAA GGTCAACCGC CACATCGCCC AGTACGTGGA CGTGATGATC
GGCAACGAAG AAGATTTCAC CGCCTGCCTG GGCTTTGAGG TCGAAGGCCT GGACGAGCAC
ATCAGCGCGA TCGATCCGGC CAACTTCAAG AAGATGATCC AGACGGCCGT GAAGCAGTTC
CCGAACTTCA AGGTCGCCGC CACCACCCTG CGCAACGCCA AGACCGCCTC GGTCAACGAC
TGGTCGGCGA TCCTCTACGC CGGCGGCGAG TTCTACGCCT CGATGATGCG CGAGAACCTC
GAGATCTACG ACCGCGTCGG CGGCGGCGAC GGCTTCGCCT CGGGCCTGGC CTTCGGCTTC
ATGGAAGGCA AGGGTCCGCA AGCCGCCGTC GAGTATGGCG CGGCTCACGG CGCCCTGGCC
ATGACCACCC CGGGCGACAC CTCGATGGTG CGCAAGGAAG AGGTCGAGGC CGTGATGAAG
GGCAAGGGCG CGCGGGTCAT CCGCTAG
 
Protein sequence
MTDNILDIRP ASETKWDCAS FGEVMLRFDP GFGRVRNARQ FNVWEGGGEY NVARAFRKCW 
GKRSTAVTAL PVNDLGWLVE DLMMQGGVDT SHIIWRDFDG LGRNTRVGLN FTEKGFGVRP
ALGCSDRGHS AASQIRPGEV NWEKLFGEEG VRWFHTGGIF AALASNTAEA VIEAVEVARK
YSTVISYDLN YRASLWKSQG GKEGAQKVNR HIAQYVDVMI GNEEDFTACL GFEVEGLDEH
ISAIDPANFK KMIQTAVKQF PNFKVAATTL RNAKTASVND WSAILYAGGE FYASMMRENL
EIYDRVGGGD GFASGLAFGF MEGKGPQAAV EYGAAHGALA MTTPGDTSMV RKEEVEAVMK
GKGARVIR