Gene Pden_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_3970 
Symbol 
ID4582521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp1107420 
End bp1108631 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID639771279 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_917732 
Protein GI119386677 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.615432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCG ACGTCGAAAA GGTCCGCGCC GATTTCCCGA TCCTGTCGCG GCAGGTGAAC 
GGCCGGCCGC TGGTCTATCT GGACAACGGC GCCTCGGCGC AGAAACCGCG CGTGGTGATC
GACGCCATCA CCCGCGCCTA TGAGGCGGAA TATGCCAATG TCCATCGCGG GCTGCATTTC
CTGTCCAACC TCGCGACCGA GAATTACGAG CGCGTGCGCG CCATCATCGC CCGCTTCCTG
AACGCCCCGC GCGAGAACGA GGTGATCTTC ACCTCGGGCG CGACCGAAGG AATCAACCTC
GTCTCCTATG GCTGGGCCGC GCCCCGCCTG CAGGCCGGCG ACGAGATCGT GCTGTCGGTG
CTGGAGCATC ACGCCAATAT CGTGCCCTGG CATTTCCTGC GCGAGCGCCA GGGCGTGGTG
CTGAAATGGG TCGAGCCCGA GCCCGACGGC TCGCTGCCGC CCGAAAAGGT GCTGGCGGCG
GTGGGCCCGC GCACCCGGCT GATCGCCGTC ACGCATATGT CGAACGTGAC CGGCACCGTG
GTCGATGTCG GCGCCATCGC ACGCGGCACC TCGGTGCCGG TTCTGGCCGA CGGGTCGCAG
GCCGCCGTGC ATATGCCGGT GGACCTGTCC GCGCTCGGCG TCGATTTCTA CTGCATTACC
GGGCACAAGC TCTATGGCCC TTCGGGCTCG GGTGCGATCT GGATCCGGGC CGAGCGCCAG
GCCGAGATGC GCCCCTTCAT GGGCGGCGGC GACATGATCC GCACCGTCAC CCGCGAAAGC
ATCGACTATG CCGACCCGCC GCTGCGCTTC GAGGCCGGCA CGCCCGGCAT CGCCAACCAG
ATCGGGCTGG GTGCGGCGCT GGAATACCTG ATGGCGCTTG GCATGGAGAA TGTCGCCGCG
CATGAACGCG ACCTACGCGA CTATGCCCGC GACCGGCTGC GCAGCCTGAA CTGGCTGTCG
GTGCAGGGCG ACGCGGCGGA CAAGGGCGCG ATCTTCTCGA TGACGATGCA GGGCGCACAT
GCGCATGACA TCTCGACCAT CCTCGACAAG CGCGGCATCG CGGTGCGGGC GGGCACGCAT
TGCGCCATGC CATTGCTGGA TTTCTTCGGC GTCAGCGCCA CGGCGCGCGC CAGTTTCGCG
ATGTACAACA CCCGCGCCGA AGTCGATGCG CTGATCGACG GGCTGACCTT CTGCCGCGAG
CTTTTTGCCT GA
 
Protein sequence
MNFDVEKVRA DFPILSRQVN GRPLVYLDNG ASAQKPRVVI DAITRAYEAE YANVHRGLHF 
LSNLATENYE RVRAIIARFL NAPRENEVIF TSGATEGINL VSYGWAAPRL QAGDEIVLSV
LEHHANIVPW HFLRERQGVV LKWVEPEPDG SLPPEKVLAA VGPRTRLIAV THMSNVTGTV
VDVGAIARGT SVPVLADGSQ AAVHMPVDLS ALGVDFYCIT GHKLYGPSGS GAIWIRAERQ
AEMRPFMGGG DMIRTVTRES IDYADPPLRF EAGTPGIANQ IGLGAALEYL MALGMENVAA
HERDLRDYAR DRLRSLNWLS VQGDAADKGA IFSMTMQGAH AHDISTILDK RGIAVRAGTH
CAMPLLDFFG VSATARASFA MYNTRAEVDA LIDGLTFCRE LFA