Gene Caul_4498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4498 
Symbol 
ID5901959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4870980 
End bp4872377 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content71% 
IMG OID641565017 
Productthreonine synthase 
Protein accessionYP_001686116 
Protein GI167648453 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.395589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.347683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACG TCTCCACACG GGGTGAAGCC CCATCGATCG GTTTCCTTGA TGCGGTCCTG 
GCCGGCCTGG CCCCCGACGG CGGCCTCTAT GTGCCCGAGC GCTGGCCGAC CTTCACACCC
GATGAGATCG CGGCCTTCGC CGGCCAGCCC TACGCCCAGG TGGCCGCCAC GGTGATCGGC
AAGTTCGTCG GCGACGACCT GCCCGCCGAC GATCTGCGCG AGATGTGCGA AGAGGCCTAT
GCCAGCTTCG CCCACGCCGC CGTCACGCCG CTCAAGCAAC TGGCCCCCGG CCGCTACCTG
CTGGAGCTGT TCCACGGCCC GACCCTGGCC TTCAAGGACG TGGCGATGCA GCTGCTGGGG
CGGCTGTACG ACTACGTGCT GTCGCGCCAG TCGCGGACCA TGACCATCAT CTGCGCCACC
TCGGGGGACA CCGGCGGCGC GGCCGTCGAG GCCTTCCGCG GCCGCTCGCA CGCCCGCATC
GTGGCCCTGT TCCCCGAAGG CCGGATCAGC GAGGTCCAGC GTCGGTTCAT GACCACGGCC
ACCGACAAGA ACGTCGCCTG CGTCTCGATC CAGGGCTCGT TCGACGACTG CCAGGCCATC
GTCAAGCAGG CCTTCTCGGA CGACCAGTTC CGCCACGCCG TCGACCTGTC GGGCGTCAAC
TCGATCAACT GGGCGCGCAT CGCCGCCCAG ACCGTCTATT TCTTCACCGC CGCCGTCGCC
CTGGGCGCCC CGGCCCGCAA GGTGGCCTTC GTGGTGCCGA CCGGCAATTT CGGCGACGCC
TACGCCGCCT ATGTCGCCAG CCGGATGGGC CTGCCGATCG CCAGGATCGT GGCCGCCACC
AATTCCAACG ACATCCTGGC GCGGGCCTTC GAGGAGGGCC GCTACACGCG CGGCGCCGTG
GCCGCCACCC AGAGCCCGGC CATGGACATC CAGGTGGCCA GCAATTTCGA GCGGCTCTAT
TTCGAGGCCG TCGGCCGCGA CGGGGTCGAG ACCGGCCGGG CCTTCCGGGC CTTCGCCGGC
ACGGGGATGC TCGACATCCC ACCCAGCGCC CACGCCAAGA TGCGCGAGCT GTTCCAGGGC
GCGTCGGTCA GCGAGGCCGA CACCGCCAAG ACCATCCTGT CGACCCTGAA CGAAACCGGC
GAGCTGATCG ACCCGCACAC CGCCGTCGGC GTGGCCGCCG CGACCAAGCT GCGCCTGGCC
GACCCGACGA CCCCGGTCGT GGTGCTGTCC ACCGCCCACC CGGCCAAGTT CCCCGAAGCC
GTGCTGGCCG CCGCCGGCCT GACCCCGGCC ACGCCGCGCG CCACCCCGGA CCTGTCGAAG
AAGCCGGAGA AGTTCGACCG CCTGCCGGCC GACGCCGAGA CGGTGAAGGC CTTCGTGCGG
GTGTTCGCGG CGGGGTGA
 
Protein sequence
MRYVSTRGEA PSIGFLDAVL AGLAPDGGLY VPERWPTFTP DEIAAFAGQP YAQVAATVIG 
KFVGDDLPAD DLREMCEEAY ASFAHAAVTP LKQLAPGRYL LELFHGPTLA FKDVAMQLLG
RLYDYVLSRQ SRTMTIICAT SGDTGGAAVE AFRGRSHARI VALFPEGRIS EVQRRFMTTA
TDKNVACVSI QGSFDDCQAI VKQAFSDDQF RHAVDLSGVN SINWARIAAQ TVYFFTAAVA
LGAPARKVAF VVPTGNFGDA YAAYVASRMG LPIARIVAAT NSNDILARAF EEGRYTRGAV
AATQSPAMDI QVASNFERLY FEAVGRDGVE TGRAFRAFAG TGMLDIPPSA HAKMRELFQG
ASVSEADTAK TILSTLNETG ELIDPHTAVG VAAATKLRLA DPTTPVVVLS TAHPAKFPEA
VLAAAGLTPA TPRATPDLSK KPEKFDRLPA DAETVKAFVR VFAAG