Gene Caul_3180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3180 
Symbol 
ID5900635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3443909 
End bp3444910 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID641563684 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001684805 
Protein GI167647142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.526016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTGG CCCGTTTCCC CCGCGCCCGT TTCGCCCACC TGCCCACGCC CCTGGAGCCC 
CTGCCCCGCC TGGGCGCGGA GCTCGGGATC GACCTGTGGG TCAAGCGCGA CGACTGCACC
GGCCTGGCCG GCGGCGGCAA CAAAACCCGC AAGCTGGAGT TCCTGCTGGG CGAGGCGCTC
GCGCAAGGCG CCGACACCCT GGTGACGCAG GGCGCGGTGC AGTCCAACCA CGTGCGCCAG
ACCATCGCCG CCGGGGTCCG GTTCGGCCTG AAGAGCGAGA TCATCCTGGA GGAGCGCACA
GGCTCCAAGG CCAGCGACTA TACCGGCAAC GGCAATGTGC TGCTCGACCG GCTGATGGGC
GCCTCGATCC GCTTTGTGCC CGGTGGGACC GACATGGTCG AGGAGCTGGA GATTTCAGCG
GCGAGGGTGC GCCAGCGCGG CGGCAAGCCC TATGTCATCC CCGGCGGCGG CTCCAACACG
GTCGGGGCGC TGGGCTATGT CGATTGCGCC CGCGAACTGG TGGTGCAGGC CGACGCCATG
GATCTGAAGA TCGACCGTCT GGTCACCGCC ACCGGCAGCG CCGGCACCCA CGCCGGCCTG
GTCGCGGGCT TCGCGGCGCT CAGCGTCGAC ATCCCGATCC TGGGCTTTGG CGTGCGCGCC
CCTAAGGCCA GGCAGGAGGA AAACGTCTTC AACCTGGCGG TCGCCACGGC CGAGACCATC
GGCGCCGGCG GACGGGTGAC GCGGGACAGG GTGATCGCCG ACTGCGACTA TGTCGGCGCG
GGCTACGGCC TGGTCGACCA GGGGGTGATC GACGCCCTGA CCCTGGCGGC TCGCACCGAG
GGCCTGCTGC TGGATCCGGT CTACTCCGGC AAGGCGATGA AGGGCCTCAT CGACCAGGCC
CGCAAGGGCG CGTTCAAGGG CGAGCGGGTG GTGTTCCTGC ACACCGGCGG GGCGCAGGGG
TTGTTTGGGT ATCAAAGCGA ACTGGAGGCG GCCCTTGTCT AA
 
Protein sequence
MHLARFPRAR FAHLPTPLEP LPRLGAELGI DLWVKRDDCT GLAGGGNKTR KLEFLLGEAL 
AQGADTLVTQ GAVQSNHVRQ TIAAGVRFGL KSEIILEERT GSKASDYTGN GNVLLDRLMG
ASIRFVPGGT DMVEELEISA ARVRQRGGKP YVIPGGGSNT VGALGYVDCA RELVVQADAM
DLKIDRLVTA TGSAGTHAGL VAGFAALSVD IPILGFGVRA PKARQEENVF NLAVATAETI
GAGGRVTRDR VIADCDYVGA GYGLVDQGVI DALTLAARTE GLLLDPVYSG KAMKGLIDQA
RKGAFKGERV VFLHTGGAQG LFGYQSELEA ALV