Gene Ent638_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1764 
Symbol 
ID5113285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1917806 
End bp1919026 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID640491953 
Productbifunctional cysteine desulfurase/selenocysteine lyase 
Protein accessionYP_001176494 
Protein GI146311420 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000868369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000289207 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTCC CGGTAGAGAA AGTGCGGGCG GATTTCCCTG TCCTGACCCG TGAAGTCAAC 
GGTTTGCCGC TTGCCTATCT CGACAGCGCC GCCAGCGCGC AAAAGCCGAA TCAGGTGATT
GACGCCGAGA TGGAATTTTA CCGCCACGGC TATGCGGCCG TGCATCGCGG CATTCATACC
CTGAGCGCAG AAGCCACTCA GCGCATGGAA AATGTCCGCA CGCAGGTAGC GGCATTCCTG
AACGCCCGTT CAGCGGAAGA GCTGGTGTTT GTGCGCGGCA CAACAGAGGG GATCAACCTG
GTCGCCAATA GCTGGGGCAA TGCGCAGGTG CATGCGGGCG ATAATATCGT GATCACCCAG
ATGGAGCACC ACGCCAATAT CGTGCCGTGG CAGATGCTCT GTGAGCGCTC AGGCGCACAG
CTGCGCGTCA TTCCACTTAA CGTGGACGGC ACGTTGCAGC TGGAACAGCT CGACGCGTTG
CTTGACGCGC GTACGCGACT GGTGGCGATT ACGCAGATCT CTAACGTTCT TGGCACCGCG
AATCCGGTCG CAGAAATCAT TGCGAAAGCG CATCAGGCTG GCGCAAAAGT GCTGGTGGAT
GGCGCACAGG CCGTTATGCA TCACACTATT GACGTGCAGG CGCTGGACTG TGATTTTTAC
GTGTTTTCCG GTCACAAGCT GTATGGCCCA ACCGGAATCG GTGTGCTGTA TGTGAAAGAA
GATATTTTGC AGGCGATGCC GCCGTGGGAA GGGGGCGGAT CGATGATTGC GACCGTCAGC
CTGACGCAAG GCACGACCTA CGCCAAAGCC CCGTGGCGCT TTGAAGCGGG TACACCGAAT
ACGGGCGGGA TCATCGGGCT GGGTGCGGCA ATCGACTACG TTTCCACACT CGGTTTGGAT
GCTATCGCCG AGTATGAAGC GTCGCTGATG CGCTATGCGC TGGCGGAAAT GGCCAGCGTC
CCGGATCTCA CGCTGTACGG CCCTGACGCG CGTAAAGGCG TTATTGCCTT TAATCTGGGC
AAACATCACG CTTACGACGT GGGCAGTTTC CTTGATAATT ATGGCGTGGC GGTACGAACG
GGTCACCACT GCGCAATGCC GCTGATGGCG TTTTACCAGG TCCCGGCAAT GTGCCGCGCG
TCGCTGGTGA TGTATAACAC GACGGAAGAG GTCGACAGGC TGGTGACGGG GCTCAAACGC
ATCCATCATC TCCTGGGATA A
 
Protein sequence
MSFPVEKVRA DFPVLTREVN GLPLAYLDSA ASAQKPNQVI DAEMEFYRHG YAAVHRGIHT 
LSAEATQRME NVRTQVAAFL NARSAEELVF VRGTTEGINL VANSWGNAQV HAGDNIVITQ
MEHHANIVPW QMLCERSGAQ LRVIPLNVDG TLQLEQLDAL LDARTRLVAI TQISNVLGTA
NPVAEIIAKA HQAGAKVLVD GAQAVMHHTI DVQALDCDFY VFSGHKLYGP TGIGVLYVKE
DILQAMPPWE GGGSMIATVS LTQGTTYAKA PWRFEAGTPN TGGIIGLGAA IDYVSTLGLD
AIAEYEASLM RYALAEMASV PDLTLYGPDA RKGVIAFNLG KHHAYDVGSF LDNYGVAVRT
GHHCAMPLMA FYQVPAMCRA SLVMYNTTEE VDRLVTGLKR IHHLLG