Gene Caul_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5003 
Symbol 
ID5902465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5403891 
End bp5404940 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID641565524 
Productdehydratase 
Protein accessionYP_001686621 
Protein GI167648958 
COG category[I] Lipid transport and metabolism 
COG ID[COG2030] Acyl dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA CCGATCCAGG GAACTATTTC GAGGACTTCC GCCTGGGCCA GGTGATCGTC 
CACGCCACTC CGCGGACGAT CACGGCGGGC GACGTGGCGC TGTACACGGC GTTGTACGGA
CCGCGCTTTT CGCTGTTCTC GTCGGACGCC TTCGCGCGCG ATTGCGGGCT GGAGACCGCG
CCGGTCGATC CGTTGGTCGC CTTCCACGTG GTGTTCGGCA AGACCGTGCC CGACATCAGC
CTGAACGCCG TGGCCAATCT CGGCTACGCC GAGGGCCGGT TCCTGGCCCC CGTGCATCCG
GGCGACACCC TGGCGGCCAA GTCCGAGGTG ATCGGGCTGA AGGAGAACTC CAACGGCAAG
ACGGGCGTGG TCTATGTGCG CACCACCGGG ACCAACCAGC GGGGCGTGGC CGTGCTCAGC
TATGTGCGGT GGGTGATGGT GCGCAAGCGC GATCCGGGCG CGGTGGTCGA GGGGCAGAGC
ATCCCGGCGC TGGCCGGGGC CGTGGCCGCC GAGCACCTGA CCCCGCCGCC GGGCCTGAGC
TTTTCCAAGT ACGACTTCGC CCATGCCGGC GCGCCGCACG CCTTCGAGGA CTATGCGGTC
GGCGAGAAGA TCGACCATGT CGACGGCATG GTGGTCGAGG AGGCCGAGGC CCAGATGGCC
ACGCGGCTGT GGCAGAACAC CGCCAAGGTT CATTTCAACC AGTTCGAGCG CGCCAAGGAC
CCCTCGAGCC GGCGGCTGGT CTATGGCGGG GTGGTGATCT CGACGGCCAA GGCCCTGTCG
TTCAACGGGC TGCAGAACGC GGGCCTGATC CTGGCGATCA ATGGCGGCCG CCATGTCAGC
CCCTATTTCG CCGGCGGCAC GGTGTTCGCC TGGTCGGAAG TCCTGGACAA GGCCGACCTG
GGTCACGGGA TCGGCGCCTT GCGCCTGCGG CTGGTGGCGA CGGTCGATCG GCCTTGCGCC
GACTTCCCCG ACAAGGACGA GGCCGGGGCC TACGCGCCCG GCGTCATCCT CGACTTCGAC
TACTGGGCGG CGGTTCCCAA GCGTGGCTGA
 
Protein sequence
MSKTDPGNYF EDFRLGQVIV HATPRTITAG DVALYTALYG PRFSLFSSDA FARDCGLETA 
PVDPLVAFHV VFGKTVPDIS LNAVANLGYA EGRFLAPVHP GDTLAAKSEV IGLKENSNGK
TGVVYVRTTG TNQRGVAVLS YVRWVMVRKR DPGAVVEGQS IPALAGAVAA EHLTPPPGLS
FSKYDFAHAG APHAFEDYAV GEKIDHVDGM VVEEAEAQMA TRLWQNTAKV HFNQFERAKD
PSSRRLVYGG VVISTAKALS FNGLQNAGLI LAINGGRHVS PYFAGGTVFA WSEVLDKADL
GHGIGALRLR LVATVDRPCA DFPDKDEAGA YAPGVILDFD YWAAVPKRG