Gene Caul_0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0988 
Symbol 
ID5898443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1043549 
End bp1044670 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content70% 
IMG OID641561470 
Producthypothetical protein 
Protein accessionYP_001682616 
Protein GI167644953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.4586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.292703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGATT TCAAGGGCAT GAAACGCCAG CGCGGCCGCA ACAATCGCGG CGGAGCGGGT 
AGTGGCGGCA AGCCTCAGCA GCACAACGCC AACCGGGCCT TCGATTCGAA CGGCCCCGAA
GGCGTGAAGG TGCGCGGCGC GGCCCAAAGC GTCTATGAAA AGTACCAGCA GCTGGCCCGC
GACGCGACGT CGTCCGGCGA CCGGGTTCTG GCCGAGAACT ACCTGCAGCA CGCCGAGCAC
TATTTCCGGG TGCTGCGCGC CATCCAGCCG AATCGCCCCG TCAGCGACAT CATCGGCAAG
GACGCCTATT CGGCCTACGA GATCGATTTC GAGGCCGAGC CGGAAGAGCA GACCGAAGCG
CCAGAAGCCG CCCAGTCCGA GACTCAGGGC GATGGCGACG GCGGTGATCA GGGTCAGGGC
GAAGGCCGCC GCGACCGGTT CGAGAACCGT CCCCGCGACG ACCGCCCCCG GGAAGACCGG
CAGCGCGATG ACCGTCCCCG TGACGGCCAG CGCGACGATC GTCCTCGCGA AAACCGCGAC
CGCTTCGAGA ACCAAGGTCA AGGTCAAGGT CAAGGTCAGG GTCAAGGCCG CCGGGATCGT
TGGCGCGACC GTGACGACCG TCCCCGTGAT GGCCAGCGTG ATGATCGTCC CCGTGATGAT
CGTCCGCGCG AAGATCGCCC GCGCGACGAC CGTCCCCGCG AAGATCGTCC GCGTGACGAC
CGTCCGCGCG AAGATCGCTT CCGTGACGAA CGTCCTCGTG ACGACCGCCC CCGTGAAGAC
CGGCCGGCCG TCGTCGAGGC GGCCGTCGAG GCTCCCGTCG AAGCCCGCCG CGAGCGTCCG
CGCCGCGAAC GGGCTCCGCG CGACCGCGAT CCCATGGCGG TGATCGAGCC GCAGGCCATG
CCGCTGACCA GCGAGGCTCC GGCCTCGCCG GTGCTGCGCG GCCAGGACGG CGACGTCAGC
CACGCCCCGG CCTTCCTGGG CCGCAAGGCG CCGCGCGCCG AAGCCCCGGT CCAGGCGGCT
CCCGTCGCGC CGTCGGCCGA CGAAGCGCCG GCCAAGCCCA AGCGTCGCCG CGCTCCGCGC
AGCTTCGAAG GCAGCGCCGC GCCGGAGTCG GAAGAGGTCT AG
 
Protein sequence
MRDFKGMKRQ RGRNNRGGAG SGGKPQQHNA NRAFDSNGPE GVKVRGAAQS VYEKYQQLAR 
DATSSGDRVL AENYLQHAEH YFRVLRAIQP NRPVSDIIGK DAYSAYEIDF EAEPEEQTEA
PEAAQSETQG DGDGGDQGQG EGRRDRFENR PRDDRPREDR QRDDRPRDGQ RDDRPRENRD
RFENQGQGQG QGQGQGRRDR WRDRDDRPRD GQRDDRPRDD RPREDRPRDD RPREDRPRDD
RPREDRFRDE RPRDDRPRED RPAVVEAAVE APVEARRERP RRERAPRDRD PMAVIEPQAM
PLTSEAPASP VLRGQDGDVS HAPAFLGRKA PRAEAPVQAA PVAPSADEAP AKPKRRRAPR
SFEGSAAPES EEV