Gene Namu_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4420 
Symbol 
ID8450046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4904157 
End bp4905326 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content71% 
IMG OID645043467 
ProductCystathionine gamma-synthase 
Protein accessionYP_003203696 
Protein GI258654540 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCTG CCCCCGGTCC CCACGACCCG CAGTCCCCGG GTTTTGCCAC CCGCGCCATC 
CACGCCGGCC AGGATCCGGA CCCGCTGACC GGAGCGGTGG CCGTGCCGAT CTACCAGACC
TCGACGTTCG CGCAGGACGA GGTCGGCCAG CCGCGGGCCG GCTACGACTA CTCCCGCGCC
GGCAACCCGA CCCGGACTGC GCTGGAACAG GCGCTGGCCG CCTTGGAGGG GGGTCGCTCG
GGTTTCGCCT TCGCCTCCGG GATGGCCGCC GCCGATACCT ACATCCGCGC CGCGCTGCGA
CCGGGCGACC ACCTGATCCT GCCCGACGAC GCCTACGGCG GCACCTTCCG CCTGGTCGAC
AAGATCTGCG TGCCATGGGG TCTGACCTAC TCGACGGTGT CGCTGGGCGA CCTGGCCGCG
GTCCGCGCGG CGATCCGGCC GACCACCAAG GTCATCTGGT GCGAGACGCC GACCAACCCG
CTGCTGGGCA TCGCCGACAT TGCCGCACTG GCCGAGATCG CCCACGAGAG CGGTGCGAAG
CTGTTGGTGG ACAACACCTT TGCCTCGCCC TACCTGCAGC AGCCGCTGGC TCTGGGAGCC
GACGTGGTGC TGCACTCGAC GACCAAGTAC GTGGGCGGGC ATTCCGACGT GATCGGCGGG
GCGTTGATCG TGGACGACCC GGAGTTGGCC GAGGCCCTGG CCTTCCACAG CAAGTCGATG
GGCGCGGTGC CCGGCCCGGT CGACGCCTGG CTGACCCTGC GCGGCGTGAA AACCCTGGCC
GTGCGGATGG ACCGGCACTG TGACAACGCC GAGCGGGTGG TGGAGCTGCT GGTCGGGCAC
CCCCGGGTGG CCCGGGTCTA CTACCCGGGC CTGCCCGCCC ATCCGGGTCA CGCGATCGCG
GCGCGGCAGA TGCGCCGGTC CGGCGGCATG GTGTCCTTCA GCGTGGTCGG CGGGCAGGAG
GAGGCCCTCA AGGTGTGCCG GCGAACCCAG TTGTTCACGC TGGGGGAGTC GCTCGGTGGG
GTGGAATCGT TGATCGAGCA TCCCGGGCTG ATGACCCACG CCAGCGTCGC CGGCTCCGCG
CTGCAGGTGC CGGACGACTT GATCCGGCTC TCCGTCGGCA TCGAGGACGC CGACGACCTG
CTGGCCGACC TGCGGGACGC CCTCGACTGA
 
Protein sequence
MTPAPGPHDP QSPGFATRAI HAGQDPDPLT GAVAVPIYQT STFAQDEVGQ PRAGYDYSRA 
GNPTRTALEQ ALAALEGGRS GFAFASGMAA ADTYIRAALR PGDHLILPDD AYGGTFRLVD
KICVPWGLTY STVSLGDLAA VRAAIRPTTK VIWCETPTNP LLGIADIAAL AEIAHESGAK
LLVDNTFASP YLQQPLALGA DVVLHSTTKY VGGHSDVIGG ALIVDDPELA EALAFHSKSM
GAVPGPVDAW LTLRGVKTLA VRMDRHCDNA ERVVELLVGH PRVARVYYPG LPAHPGHAIA
ARQMRRSGGM VSFSVVGGQE EALKVCRRTQ LFTLGESLGG VESLIEHPGL MTHASVAGSA
LQVPDDLIRL SVGIEDADDL LADLRDALD