Gene YpAngola_A3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3222 
SymbolcsdA 
ID5801698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3411648 
End bp3412853 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content47% 
IMG OID641341050 
Productcysteine sulfinate desulfinase 
Protein accessionYP_001607575 
Protein GI162420213 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000462422 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.220152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTT TTAATCCAAT GGATTTTCGT CGGGAATTCC CTGCGCTCAG TGATAAATTA 
ACCTATCTGG ACAGTGCGGC GACCGCCTTG AAACCACGTG CAATGATTGA CGCGACACAG
CAATTTTATC AGCAGGATTC AGCAACGGTA CACCGCAGCC AACATCAATC GGCGCTGTCA
TTAACGGTTC GCTTTGAAAA CACCCGCCAA CAAGTGGCTG ATTTTATTAA CTCATCTACA
GCAGAAAATA TTATCTGGAC GCGAGGAACA ACTGAAGCCA TCAATCTGAT CGCGCAAAGT
TATGCCCGCC CCCGTTTACA ACCTGAAGAT GAAATTATTG TCAGCGAAGC TGAACATCAT
GCGAATTTAA TTCCCTGGTT GATGGTAGCG GAGCAGACCG GTGCAAAAAT AGTCAAATTA
CCTCTTGGCC TTGATCATCT GCCAGATTTA CAGCAACTCC CTCAACTACT TAATGAAAAA
ACACGCATAT TAGCGCTGGG GCAGATGTCT AACGTAACAG GCGGTAGCCC TGATCTGGCT
CAGGCTATTA GGCTGGCTCA CCAATATGAC TGTGTTGTCG TGGTTGACGG TGCTCAGGGA
ATTGTTCATT ACCCAGCCGA TGTTCAGGCA TTGGATATTG ATTTTTATGC ATTCTCTTCC
CATAAATTGT ATGGCCCAAC CGGCATTGGC GTGCTGTATG GGAAGACTGA ATTATTAGAA
GAGATGCCCG CCTGGCAAGG CGGCGGTAAA ATGCTTACCC ATGCATCATT CGGGGGCTTT
ACACCTCATG AAGTGCCTTA TCGCTTTGAA GCGGGTACAC CCAATATTGC TGGCGTTATT
GGTTTATCAG CGGTACTCAA ATGGCTGGAA CATATTGATC TGGAAGAGGC CGAAGTTTAT
AGCCAAGGTT TAGCTACAAT GGCAGAAAAT AAGCTCGCAC AATTACCGGG TTTTCACAGT
TACCGTTGTC AGCAATCCAG TTTATTAGCA TTTACTTTCG ATGGTGTTCA TCACAGTGAT
TTAGTGGCGT TATTGGCCGA GCAAGGTATC GCACTACGTG CTGGGCAGCA CTGCGCACAG
CCACTGATGG CGGCTCTGGG AGTCAATGGC AGTCTGCGGG CTTCTTTTGC GCCTTATAAT
ACCCCCCAAG ATGTTGAAAT GCTTTGCTCT GCGCTTGGTA AGGCATTGGA ACTGCTTCGA
GACTAA
 
Protein sequence
MKVFNPMDFR REFPALSDKL TYLDSAATAL KPRAMIDATQ QFYQQDSATV HRSQHQSALS 
LTVRFENTRQ QVADFINSST AENIIWTRGT TEAINLIAQS YARPRLQPED EIIVSEAEHH
ANLIPWLMVA EQTGAKIVKL PLGLDHLPDL QQLPQLLNEK TRILALGQMS NVTGGSPDLA
QAIRLAHQYD CVVVVDGAQG IVHYPADVQA LDIDFYAFSS HKLYGPTGIG VLYGKTELLE
EMPAWQGGGK MLTHASFGGF TPHEVPYRFE AGTPNIAGVI GLSAVLKWLE HIDLEEAEVY
SQGLATMAEN KLAQLPGFHS YRCQQSSLLA FTFDGVHHSD LVALLAEQGI ALRAGQHCAQ
PLMAALGVNG SLRASFAPYN TPQDVEMLCS ALGKALELLR D