Gene Spro_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4212 
Symbol 
ID5604387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4669055 
End bp4670251 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content59% 
IMG OID640939772 
Productcystathionine beta-lyase 
Protein accessionYP_001480434 
Protein GI157372445 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01324] cystathionine beta-lyase, bacterial 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCAA AAAACATCGA AACCACCCTG ATCGGCGCAG GACGCAGCAA GCGCTATACC 
CAGGGTTCCG TCAATCCCGT CACCCAACGC GCTTCTTCAC TGGTATTTGA TTCGGTGGCG
GCCAAAAAGC ACGCCACTGC CAGGCGGGCC CATGGCGAAC TGTTTTATGG CCGGCGCGGC
ACCCTGACCC ACTTTGCGTT ACAGGATGCG ATGGTTGAGC TGGAAGGCGG CGCAGGTTGT
GTACTTTACC CTTGTGGCGC GGCTGCGGTC TCCAACGCTA TCCTTTCGTT TGTCAGCGCC
GGCGATCACC TGCTGATGAC CGGCTCGGTG TATGAACCCA CGCAGGATTT TTGCAGCCAT
ATTCTGAGCC GCATGAACGT GGCAACCACC TATTTCGACC CGCTGATCGG AGCCGATATC
GCCGGGCTTA TTCAGCCCAA TACCCGCGTG GTGTTTCTGG AGTCACCCGG ATCGATCACC
ATGGAAGTTC AGGATATCCC GGCCATGGTG CAGGCCATTC GCGCGGTGGC ACCGGAAGTG
GTGATCATGA TCGACAATAC CTGGGCGGCC GGTGTGCTGT TCAAGGCGCT GGACTTCGAC
ATTGATATTT CGATTCAGGC AGGGACCAAA TACATTATCG GCCACTCCGA TTACATGCTC
GGCACCGCCG TGGCAAACCA GCGCTGTTGG GATCAACTGC GCGAATACTC TTATCTGATG
GGCCAGATGG TGGACGCCGA TACCGCCTAT ATGGCCAGCC GTGGCCTGCG CACGCTGGGC
GTACGCCTGA AGCAACATCA GCAAGGCAGC ATCGAAGTCG CCAACTGGCT GGCCGAACGG
CCGGAGGTGG CAGTGGTCAA TCACCCGGAG CTGCCCAGCT GCAAGGGCCA CGAATTTTAC
CGCCGCGATT TCAGCGGCTG CAACGGCCTG TTCTCTTTCG TGCTGAAACA GCGTCTGAGC
GACGCACAGT TGGCCGAGTA TCTGGACAAC TTCAACCATT TCAGCATGGC CTATTCCTGG
GGCGGTTATG AATCGCTGAT CCTGGCCAAC CAACCGGAAG AACTGGAAGC CATTCGTCCG
GCCGGCGGCG TCGACTTCTC CGGCACCCTG GTGCGCCTGC ATATCGGACT GGAGAACGTG
CAGGATCTGA TTGACGACCT CGCCGCCGGA TTCGAGCGTA TCGCCAACGC ACACTGA
 
Protein sequence
MTSKNIETTL IGAGRSKRYT QGSVNPVTQR ASSLVFDSVA AKKHATARRA HGELFYGRRG 
TLTHFALQDA MVELEGGAGC VLYPCGAAAV SNAILSFVSA GDHLLMTGSV YEPTQDFCSH
ILSRMNVATT YFDPLIGADI AGLIQPNTRV VFLESPGSIT MEVQDIPAMV QAIRAVAPEV
VIMIDNTWAA GVLFKALDFD IDISIQAGTK YIIGHSDYML GTAVANQRCW DQLREYSYLM
GQMVDADTAY MASRGLRTLG VRLKQHQQGS IEVANWLAER PEVAVVNHPE LPSCKGHEFY
RRDFSGCNGL FSFVLKQRLS DAQLAEYLDN FNHFSMAYSW GGYESLILAN QPEELEAIRP
AGGVDFSGTL VRLHIGLENV QDLIDDLAAG FERIANAH