Gene Cphy_2907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2907 
Symbol 
ID5743967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3565952 
End bp3567148 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content38% 
IMG OID641294007 
Productcysteine desulfurase NifS 
Protein accessionYP_001560004 
Protein GI160881036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000201074 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAA TGATTTATTT AGATAATGCT GCTACCACTC AGACCAGACC GGAAGTAGTA 
GAGGCCATGT TACCATATTT TTATGAGAAT TATGGGAATC CTTCCAGCGT ATATGAAATC
GCGACGAGAA GTAGAAAAGC AGTAACAGAA GCAAGAGATA TCATTGCTAA GACCATAGGC
TGTGAGAATA ATGAGATTTA TTTTACCGCT GGTGGATCTG AGTCTGACAA CTGGGCGATT
AAAGGTGTTG CAGAAGCTTA TCGTGACAAG GGTAATCATA TTATTACATC TAAGATCGAA
CACCATGCGG TTTTGCATAC TTGTGAGTAC TTAGAGAAAC TTGGGTTTGA AGTTACTTAT
CTCGATGTGG ATGAAAGCGG AATTGTAAAG CTTGATCAAT TAAAAGCTGC GATTCGTCCA
ACCACTATCT TAATATCAAT TATGTATGCA AATAATGAAA TCGGTGCGAT TCAGCCTGTA
AAAGAAATTG GTGATATTGC GAAGCAGCAC AATATTTTAT TCCATACGGA TGCAGTTCAG
GCTTTTGGAC AGTTGCCAAT CGATGTGAAA GAACTTGGTA TTGATATGTT AAGTGCCAGT
GGTCATAAAT TAAATGGACC AAAGGGAATT GGTTTCCTCT ATATTAGAAA TGGCCTTAAG
GTACGTTCTT TTGTTCACGG CGGCGCTCAG GAAAGAAAGC GTAGAGCAGG TACTGAAAAT
GTACCAGGTA TTGTTGGATT TGGTAAGGCA GTTGAGCTTG CAGCATCCAA TTTAAAGGAA
AGAACCAAGA AGGAAATAGA ACTTCGAGAC TATCTTATTG AGCGAGTATT AAAAGAAGTT
CCTTACACTA GATTAAATGG ACATAGTAAG AATCGTTTAC CAAATAACGC AAACTTAAGC
TTCCAATTCA TCGAGGGAGA ATCTCTATTA ATCATGCTCG ATATGCAAGG AATTGCAGCA
TCCAGTGGTT CAGCTTGTAC TTCTGGATCA TTAGATCCTT CTCACGTTTT ATTGGCAATT
GGATTACCAC ATGAAATTGC ACATGGCTCA TTAAGATTAA CTCTAAGTGA GGACACAACA
AAAGAAGATA TCGATTTCAC AATCGATCAG ATAAAAGAGA TTGTAGATAA ATTAAGACAG
ATGTCACCAC TGTACGAAGA CTTTATGAAA AAGTTAGCAA AGAATCGTGC AGAATAA
 
Protein sequence
MGKMIYLDNA ATTQTRPEVV EAMLPYFYEN YGNPSSVYEI ATRSRKAVTE ARDIIAKTIG 
CENNEIYFTA GGSESDNWAI KGVAEAYRDK GNHIITSKIE HHAVLHTCEY LEKLGFEVTY
LDVDESGIVK LDQLKAAIRP TTILISIMYA NNEIGAIQPV KEIGDIAKQH NILFHTDAVQ
AFGQLPIDVK ELGIDMLSAS GHKLNGPKGI GFLYIRNGLK VRSFVHGGAQ ERKRRAGTEN
VPGIVGFGKA VELAASNLKE RTKKEIELRD YLIERVLKEV PYTRLNGHSK NRLPNNANLS
FQFIEGESLL IMLDMQGIAA SSGSACTSGS LDPSHVLLAI GLPHEIAHGS LRLTLSEDTT
KEDIDFTIDQ IKEIVDKLRQ MSPLYEDFMK KLAKNRAE