Gene Ent638_3256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3256 
Symbol 
ID5112970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3548535 
End bp3549740 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content57% 
IMG OID640493460 
Productcysteine sulfinate desulfinase 
Protein accessionYP_001177971 
Protein GI146312897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTT TCAGCCCTGC GCACTTTCGC GCGCAGTTTC CCGCGCTGGC CGATGCCGGT 
GTTTATCTTG ATAGTGCTGC CACGGCTTTA AAGCCGCTGG CCGTTATCGA GGCAACACAG
GATTTTTATA GCCTGAGCGC CGGAAACGTC CATCGTAGCC AGTTTGCAGA AGCCCAGCGA
TTGACCGCAC GTTACGAGGC CGCGCGCGAT CAGGTGGCTG AGTTGTTAAA TGCCGAAAGC
GGTAAAAATA TCGTCTGGAC GCGCGGCACC ACAGAAGCCA TCAATATGGT CGCGCAATGC
TACGCGCGCC CGCGCCTGCA GCCTGGTGAT GAAATTATTG TCAGCGAAGC AGAGCATCAC
GCTAACCTGG TGCCGTGGCT CATGGTTGCA GAACAAACCG GCGCGCGTGT AGTTAAGCTC
CCGCTGGGTG CAGATTTATT GCCAGATATT GCCTGCTTGC CTGACCTCAT CACCTCGCGC
AGTCGGATTC TGGCGCTGGG GCAGATGTCT AACGTTACGG GGGGCTGCCC TGATCTCGCA
CGTGCCATTG AGATTGCACA TGCGAACAAT GTTGTCGTGA TGGTCGACGG CGCACAGGGC
GTGGTGCATT TTCCGGCTGA CGTGCAAAAA CTGGATATCG ACTTCTACGC CTTCTCCGCG
CACAAACTCT ATGGCCCAAC GGGCATCGGC GCGTTGTATG GCAAAGCTGA ACTGTTGGCG
CAAATGAGCC CATGGTTGGG CGGTGGCAAG ATGATCACCG AGGTGACTTT CGACGGATTT
AAAACGCAAG AAATACCCTA TCGTCTGGAA GCCGGGACGC CAAACGTGGC GGGCGTGATT
GGCTTGAGCG CCGCACTGGA ATGGCTGTCG CAAACCGACG TTGTGCAAGC AGAGAACTGG
AGTCGCGGGC TGGCAACACT CGCTGAGGAA GAACTGAAAA AACGCCCTGG TTTTCGCTCT
TTCCGAGTAC AGGATTCCAG CCTGCTGGCG TTTGATTTTG CCGGGGTACA TCATAGCGAT
TTGGTGACTT TGCTCGCGGG TTACGGCATC GCATTACGCG CTGGACAACA TTGCGCCCAG
CCGCTTCTCG CCGCGCTCGG CGTAGACGGA ACGCTTCGCG CTTCTTTTGC GCCTTACAAT
ACGCAAAACG ACGTCGACGC CCTCGTTGCC GCCGTCGATC GTGCCCTTCA ACTTTTGGTG
GATTAA
 
Protein sequence
MNAFSPAHFR AQFPALADAG VYLDSAATAL KPLAVIEATQ DFYSLSAGNV HRSQFAEAQR 
LTARYEAARD QVAELLNAES GKNIVWTRGT TEAINMVAQC YARPRLQPGD EIIVSEAEHH
ANLVPWLMVA EQTGARVVKL PLGADLLPDI ACLPDLITSR SRILALGQMS NVTGGCPDLA
RAIEIAHANN VVVMVDGAQG VVHFPADVQK LDIDFYAFSA HKLYGPTGIG ALYGKAELLA
QMSPWLGGGK MITEVTFDGF KTQEIPYRLE AGTPNVAGVI GLSAALEWLS QTDVVQAENW
SRGLATLAEE ELKKRPGFRS FRVQDSSLLA FDFAGVHHSD LVTLLAGYGI ALRAGQHCAQ
PLLAALGVDG TLRASFAPYN TQNDVDALVA AVDRALQLLV D