Gene Acid345_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4166 
Symbol 
ID4072125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4931914 
End bp4933170 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content58% 
IMG OID637986197 
Productcysteine desulphurase-like protein 
Protein accessionYP_593240 
Protein GI94971192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.744365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.275915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA GTGCACATGC CAGTGTGAAC CTGGAAGCGA TTCGAGCGCA ATTTCCTGCC 
CTCAGCCAAA CGTACAACGG GCATCCGCGA GTGTATTTCG ATGCGCCCGG TGGAACTCAG
GTACCGCAGC AGGTGATTGA CGCGATCTCG GGCTACCTGG TGCATTCGAA CTCGAACACG
CATGGGCAAT TCCATACCAG TCACCTGACC GACGAAGTGC TGGAGCACGC CCACGCAGCA
ATGGCCGACA TGCTTGGGTG CGATGCGGAT GAGATTGTGT TCGGACAGAA CATGACCACG
CTGACCTTTG CGTTGAGCCG CGCGCTGGGC CGCGATTTAC GCGCAGGGGA TGAGATCGTG
ACGACATTGC TCGATCACGA TGCGAATGTA GCGCCGTGGC GCGCGCTGGA AGAGACCGGG
GCGAGGGTGC ACGCGGTGAA GTTCCATCCC GAGGACTGCA CGCTGGATCT GGAGGATTTG
CAGTCGAAGC TGAACGGGCG GACGAAGATT GTGGCGGTGG GATTTGCGTC GAACGCGGTG
GGCACGATCA ATCCCATTAA AAAGATTGTG GAGATGGCGC ACGCGGTGGG AGCGTTGGTT
TTCGTGGACG CCGTGCACTT TGCGCCGCAT GGGTTCATCG ATGTGCGCGA TCTGGATTGC
GATTTTCTCG CGTGCTCGAC GTATAAGTTT TTTGGTCCGC ACATGGGGGT CCTATTTGGG
AAGCATGAGC ATCTGTTGCG GTTGAAGCCG TATAAGGTGC GTCCGGCGGC AGATACTTTG
CCGGACCGGT GGGAGACGGG CACCCTGAAC CATGAGTGCA TTGCGGGAAT CACGGCATGC
GTGGAGTACC TGGCCGATGT CGGGCTGAAG ACGGTGAAGC ATCCGGAGTC GCGGCGGGAT
GCGATTGCGG CGGCGTATGC GTGGATGAAA GAGCATGAGC ATGAATTGGC GAGGCAGCTT
ATCGGCGGAC TGCTGGAGAT TCCGGGGCTG ACGTTTTATG GGATCCGGGA TTTGAGCCGG
CTGGATGAGC GAACGCCAAC GGTGTCAATA CGAATGGCGA AACTTTCGCC AGCAGAGTTG
TCGAAGAAGC TTGGCGATCT CGGGATTTAT ACGTGGGATG GGAACTTCTA CGCGATCAAT
GTGACGGAAC AGTTGGGCGT GGAAGAAGAT GGCGGGATCC TACGGATCGG GTTGGCGCAT
TATGCAACTT CAGCGGAAGT GGAAAGGTTG TTGAAGGCGC TGCGAGAGTG GGCATAG
 
Protein sequence
MATSAHASVN LEAIRAQFPA LSQTYNGHPR VYFDAPGGTQ VPQQVIDAIS GYLVHSNSNT 
HGQFHTSHLT DEVLEHAHAA MADMLGCDAD EIVFGQNMTT LTFALSRALG RDLRAGDEIV
TTLLDHDANV APWRALEETG ARVHAVKFHP EDCTLDLEDL QSKLNGRTKI VAVGFASNAV
GTINPIKKIV EMAHAVGALV FVDAVHFAPH GFIDVRDLDC DFLACSTYKF FGPHMGVLFG
KHEHLLRLKP YKVRPAADTL PDRWETGTLN HECIAGITAC VEYLADVGLK TVKHPESRRD
AIAAAYAWMK EHEHELARQL IGGLLEIPGL TFYGIRDLSR LDERTPTVSI RMAKLSPAEL
SKKLGDLGIY TWDGNFYAIN VTEQLGVEED GGILRIGLAH YATSAEVERL LKALREWA