Gene Caul_3015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3015 
Symbol 
ID5900470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3282712 
End bp3284001 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content70% 
IMG OID641563516 
Producthomoserine dehydrogenase 
Protein accessionYP_001684640 
Protein GI167646977 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.330452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA AAACCTGGCG CGTCGGAGTC GCCGGCCTCG GCACAGTCGG CGGGGGTCTG 
CTGCAGTTCC TGGCCGAGCA GCCGGACTTC GCCCCGGCCG GCGACCGGGC GGTGGTGACG
GCGGTCTCGG CGCGTTCGAA GTCGCGGCCG CGCACGATCG ACATCTCGGG CCTGACCTGG
TTCGACGATC CGGTGGCCCT GGCCTCGTCG CCGGACGTGG ACCTGTTCGT CGAGCTGGTC
GGCGGCAGCG ACGGCCCGGC AAAAGCGGCT GTCGAAGCCG CCTTGAAGCT GGGCAAGCCG
GTGGTCACCG CCAACAAGGC CCTGATCGCC GAGCACGGCG CCGAACTGGC CGCCCTGGCC
GAGGCCAACA ACGCCCCGCT GCTGTTCGAA GCCGCCGTGA TGGGCGGCAC GCCGGCGGTG
AAGATGCTGC GCGAGGCCAT GGTCGGCGAC GAGGTGGTCG GGGTGGCAGG CATCCTCAAC
GGCACCTGCA ACTTCATCCT CAGCGAGATG GAGAAGACGG GCCGCGCGTT CGCTGACGTG
CTGCGCGAGG CGCAAGGGTT GGGCTACGCC GAGGCCGACC CGACCATGGA CGTCGGCGGC
TTCGACGCCG GCCACAAGAT CAGCATCCTG GCGGCCCTGG CCTTTGGTTG CGCGCCAGAC
TTCGGCGCGG CCGAAATCGA GGGCATCAGC GACGTCGAGC TGCTCGACAT CAAGCTGGCC
AAGGACCTGG GCTATCGCAT CAAGCTGGTG GCCGGGGCCG CCAAGACCGA CGACGGCGTG
TCGGTGAAGG TGCATCCGTC CCTGGTGCCG CTGGAGCATC CGCTGGCCCA GGCCGGCGGG
GCGCTCAACG CCCTGTTCAT CGAGGGCAAG CGGATAGGCC GGATCTACAT CCAGGGGCCT
GGCGCGGGCG CGGGACCGAC CGCCGCCGCC GTGGCCGCCG ACATCGCCGA CGTGATGACC
GGCGCCAAGC GCCCGGTGTT CCAGGCCCCG GCCGGCCAGC TGAAGCCGTT CGTCGCCGTC
GATCCGGCCC GTTCGGTGGG CAAGGCCTAT CTGCGGATCA TGGTCCGCGA CGAGCCGGGC
GCCATCGCCG CCATCTCCGA GACCCTGGCC GAATGCGCCG TCTCGATCGA CAGCTTCCTG
CAAAAGCCCG TCGAGGGGGC GGGCGGCGTG CCGATCGTGC TCGTCACCCA TGCGACTCCC
GAATCCAATC TGCTGGATGC GATTAGCCGC ATCGAAAAAC TGCACGCCGT GCTAGAGCGT
CCCCGCCTTT TGCGCGTCGC GCGCATCTGA
 
Protein sequence
MTQKTWRVGV AGLGTVGGGL LQFLAEQPDF APAGDRAVVT AVSARSKSRP RTIDISGLTW 
FDDPVALASS PDVDLFVELV GGSDGPAKAA VEAALKLGKP VVTANKALIA EHGAELAALA
EANNAPLLFE AAVMGGTPAV KMLREAMVGD EVVGVAGILN GTCNFILSEM EKTGRAFADV
LREAQGLGYA EADPTMDVGG FDAGHKISIL AALAFGCAPD FGAAEIEGIS DVELLDIKLA
KDLGYRIKLV AGAAKTDDGV SVKVHPSLVP LEHPLAQAGG ALNALFIEGK RIGRIYIQGP
GAGAGPTAAA VAADIADVMT GAKRPVFQAP AGQLKPFVAV DPARSVGKAY LRIMVRDEPG
AIAAISETLA ECAVSIDSFL QKPVEGAGGV PIVLVTHATP ESNLLDAISR IEKLHAVLER
PRLLRVARI