Gene YpAngola_A3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3807 
SymbolmetB 
ID5802284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4040144 
End bp4041304 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content52% 
IMG OID641341604 
Productcystathionine gamma-synthase 
Protein accessionYP_001608116 
Protein GI162421237 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR02080] O-succinylhomoserine (thiol)-lyase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTA AACAGGCAAC AATAGCAGTC CGTAGCGGGT TGAATGATGA CGAGCAATAC 
GGCTGCGTTG TCCCCCCGAT TCACCTTTCC AGTACCTACA ATTTTATTGA TTTTAATCAG
CCGCGCACGC ATGACTATTC ACGTCGTGGT AATCCAACGC GTGATGTTGT CCAACGGGCG
CTGGCGGAAT TGGAAGGGGG GGCCGGTGCC GTCATGACCA GCAGCGGGAT GTCGGCGCTT
CATTTGGTTT GCACTACATT CTTACAGCCG GGCGATCTGT TGGTCGCTCC GCATGACTGT
TACGGTGGCA GTTACCGTTT ATTTGACAGC TTGAGCAAGC GTGGTGCGTA TCGGGTGTTA
TTTGTTGATC AGGGCGATGA AGCGGCACTA AACTGCGCAT TGGAGGAGAA ACCGAAGTTG
GTCTTGATTG AAACACCGAG TAATCCATTG CTACGGGTTG TTGATATTGC CGCCATCTGC
CAAGCCGCCC GTGCTGCGGG CGCACTGACG GTTTGTGATA ACACCTTCCT CAGCCCCGCC
TTACAGCAGC CTCTCTCTCT TGGGGCCGAT TTAGTGGTGC ACTCCTGTAC CAAATATCTC
AATGGTCACT CTGATGTGGT GGCTGGTGCT GTTATTGCGA AAGATCCAGA ACTGGTTGTC
GAGCTGGCAT GGTGGGCAAA TAATATTGGT GTAACCGGTG CTGCGTTTGA CAGCTATCTA
CTCCTTCGTG GTTTACGCAC GTTATCACCA CGCATGGCTC AACAGCAGCG TAACGCGGAT
GACATTGTGC GTTATTTACA GCAACAGCCT TTAGTGAAAA AGCTGTATCA TCCTTCCCTG
CCACAACATC CCGGCCACGA AATAGCCTGC CGTCAGCAAT CAGGTTTTGG TGCAATGCTC
AGTTTTGAGC TGGATGGTGA TGAGCAGGTC ATGCGCCGTT TCCTTTCTGC CCTTGAGCTA
TTTACCTTGG CAGAGTCTTT GGGGGGGGTA GAAAGCCTGA TCTCCCATGC AGCGACCATG
ACCCACGCGG GTATGGCGGC AGAGGCGCGT ATTGCCGCAG GCATTACTGA TAGTTTGTTG
CGTATTTCCG TGGGTATTGA AGACAGTGAA GATTTGATTG CTGATTTGGA CCACGCGTTC
CAATTGGCAG TAACGAGGTA A
 
Protein sequence
MTRKQATIAV RSGLNDDEQY GCVVPPIHLS STYNFIDFNQ PRTHDYSRRG NPTRDVVQRA 
LAELEGGAGA VMTSSGMSAL HLVCTTFLQP GDLLVAPHDC YGGSYRLFDS LSKRGAYRVL
FVDQGDEAAL NCALEEKPKL VLIETPSNPL LRVVDIAAIC QAARAAGALT VCDNTFLSPA
LQQPLSLGAD LVVHSCTKYL NGHSDVVAGA VIAKDPELVV ELAWWANNIG VTGAAFDSYL
LLRGLRTLSP RMAQQQRNAD DIVRYLQQQP LVKKLYHPSL PQHPGHEIAC RQQSGFGAML
SFELDGDEQV MRRFLSALEL FTLAESLGGV ESLISHAATM THAGMAAEAR IAAGITDSLL
RISVGIEDSE DLIADLDHAF QLAVTR