Gene Noca_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2554 
Symbol 
ID4598411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2723366 
End bp2724652 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content70% 
IMG OID639777160 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_923745 
Protein GI119716780 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGA CCCAGCTGGC CGGCCTGCTC CCCGAGCTGG AGGTGATCCG CGCGGACTTC 
CCGATCCTGG ATCGGACCCT CGCGGGTGGT CTGCCGCTGG TCTACCTCGA CAGCGCCAAC
ACCTCGCAGA AGCCGCAGGT CGTGATCGAC ACCATGGTCG ACCACCTCGA ACGGCACAAC
GCGAACGTCG CCCGGGCCAT GCACCAGCTC GGCGCGGAGT CGTCGGAGGC GTTCGAGGCG
GCGCGGGACA AGGTCGCGGC GTTCATCAAC GCGCCGAGCC GGGACGAGGT GATCTTCACC
AAGAACGCCT CCGAGGCGCT CAACCTGGTG GCCAACACCC TGTCCTGGGC GCGGGGCGAC
CGGGTGAACG GAGTCGGGGC TCTCGGCGCG GGCGACGAGG TCGTCATCAC CGAGATGGAG
CACCACTCCA ACATCGTGCC GTGGCAGCTG CTGACCGAGC GCACGGGCGC GACCCTGCGC
TGGTTCGGTC TCACCGACGA CGGCCGGCTC GACCTCTCGA ACATCGACTC GCTGATCACC
GAGCGGACCA AGGTGGTCGC CCTCACCTGG GTCTCGAACA TGCTCGGCAC GATCAACCCG
GTCGCCGAGA TCGCTCGGCG GGCCCACGAG GTCGGCGCGC TCGTGGTGGT CGACGCCTCC
CAGGCCGCTC CTCAGCTGCC CGTGGACGTG GTCGCGTCGG GTGCCGACCT GCTGGCGTTC
ACCGGCCACA AGGTCGTCGG GCCGACCGGC ATCGGCGTGC TCTGGGGCCG CCGCGAGGTG
CTCGACCAGC TACCGCCCTT CCTCGGTGGC GGCGAGATGA TCGAGACGGT GCGGATGGAG
CGCTCGACGT ACGCCCCGAT CCCGCACAAG TTCGAGGCCG GTACGCCGCC CATCGTCGAG
GCCGTGGGCC TCGGCGCAGC GGTCGACTAC CTCTCGATGA TCGGCCTGGA CGCGATCCAC
CGGCACGAGC AGGCGATCAC GGCGTACGCG CTCGACGGGC TCGCCACCGT GCCCGGCCTG
CGAGTCCTCG GTCCGCTCTC CGCGAAGGAC CGCGGCGGAG CGATCGCCTT CGAGATCGAC
GGGGTGCACC CGCACGACGT CGCGCAGGTG CTCGACTCGC GCGGGGTCGC GGTCCGGGCC
GGCCACCACT GCGCGAAGCC CGCGCACGCC CGCTTCGGCG TGCAGAGCTC CACCCGGATG
TCGTCGTACC TCTACACGAC GCCGGCGGAG ATCGACGCGC TCGTCGAGGC CCTGGAATAC
ACCCGCTCGT ACTTCAAGTT GGGTTGA
 
Protein sequence
MTTTQLAGLL PELEVIRADF PILDRTLAGG LPLVYLDSAN TSQKPQVVID TMVDHLERHN 
ANVARAMHQL GAESSEAFEA ARDKVAAFIN APSRDEVIFT KNASEALNLV ANTLSWARGD
RVNGVGALGA GDEVVITEME HHSNIVPWQL LTERTGATLR WFGLTDDGRL DLSNIDSLIT
ERTKVVALTW VSNMLGTINP VAEIARRAHE VGALVVVDAS QAAPQLPVDV VASGADLLAF
TGHKVVGPTG IGVLWGRREV LDQLPPFLGG GEMIETVRME RSTYAPIPHK FEAGTPPIVE
AVGLGAAVDY LSMIGLDAIH RHEQAITAYA LDGLATVPGL RVLGPLSAKD RGGAIAFEID
GVHPHDVAQV LDSRGVAVRA GHHCAKPAHA RFGVQSSTRM SSYLYTTPAE IDALVEALEY
TRSYFKLG