Gene Caul_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1661 
Symbol 
ID5899116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1739350 
End bp1740534 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID641562150 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001683288 
Protein GI167645625 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.224768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGG ACCCGAAGGA CTGGGATATC GCCACCAAGC TGATCCGTGG CGGTCTGGCG 
CGTTCGCAAT TCATGGAGAC GGCCGAGGCC CTCTACCTCA CCCAGGGCTT CACCTATGAC
AGCGCCGAGG CCGCCGATCG CCGATTCTCG GGCGAGGATC CGGGCTTCGT CTATTCGCGG
TTCAACAATC CGACCGTGAA GATGTTCGAG GATCGCCTGG CGCTTCTGGA AGGCGCGGAA
GTGTGCCGCG CCCAGGCGAC GGGCATGGCC TCGATCCACG CCGCCCTGAT GGGCCTGGTC
CGGGCCGGCG ACCACGTGGT GGCCGGCCGC GCCCTGTTCG GCTCGTGCCG CTGGCTGATC
TCCGAATGGC TTCCGCGTTT CGGCGTCGAA ACGACCTTCG TCGACGCCAC CGATCCCAAG
GCCTGGGAGG CGGCGATCCG TCCGGGCACC AAGGCGGTGC TGATCGAGAC CCCGTCCAAC
CCCGTGCTGG AGATCACCGA CATCCGCGCG GTGGCGGAGA TGGCCCACGC GGTCGGCGCC
AAGGTCATCG TCGACAACGT CTTCGCCACC CCGATCTTCC AGCAGCCGCT GACGCTGGGC
GCCGACATCG TGGTCTATTC GGCCACCAAG CATATCGACG GCCAGGGCCG GGTGCTGGGC
GGGGCGATCC TGACCAGCGA GGCGATCAAC GAGGAATTCT ATCGCGACCC GCTGCGCCAC
ACCGGCCCGT CGCTGTCGCC GTTCAACGCC TGGGTGCTGG TCAAGGGCCT CGAAACCCTG
GACCTGCGCG TGCGCCGCCA GAACGACACC TGCGCGGCCC TGGCCGACCT GGTCGCCGAA
CACAAGGCGG TCAAGCAGGT GCTCTACCCG TTCCGCGCCG ACCACCCGGG CCACAATGTC
GCCAAGTCGC AGATGAGCGG CGGCGGCACC GTCCTGGCCC TCGACCTCGG CTCGCGCGAG
GCGGCGTTCA AGTTCCTCAA CGCCCTGGAG ATCGTCGACA TTTCCAACAA TCTGGGCGAC
GCCAAGTCGA TGGCCACCCA TCCGCCGACC ACCACGCACC GCTCCGTCCC CGAGGAGATG
CGCCCGTCGC TGGGCGTGGT CGAAGGCGGC GTGCGCCTGT CGGTCGGCCT GGAAAGCCTG
TCGGACCTGT CCCGAGATGT GATCCGTGCG CTCGATCAGG CGTGA
 
Protein sequence
MAEDPKDWDI ATKLIRGGLA RSQFMETAEA LYLTQGFTYD SAEAADRRFS GEDPGFVYSR 
FNNPTVKMFE DRLALLEGAE VCRAQATGMA SIHAALMGLV RAGDHVVAGR ALFGSCRWLI
SEWLPRFGVE TTFVDATDPK AWEAAIRPGT KAVLIETPSN PVLEITDIRA VAEMAHAVGA
KVIVDNVFAT PIFQQPLTLG ADIVVYSATK HIDGQGRVLG GAILTSEAIN EEFYRDPLRH
TGPSLSPFNA WVLVKGLETL DLRVRRQNDT CAALADLVAE HKAVKQVLYP FRADHPGHNV
AKSQMSGGGT VLALDLGSRE AAFKFLNALE IVDISNNLGD AKSMATHPPT TTHRSVPEEM
RPSLGVVEGG VRLSVGLESL SDLSRDVIRA LDQA