Gene Noca_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0939 
Symbol 
ID4597472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp985091 
End bp986479 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content71% 
IMG OID639775542 
Productcystathionine beta-synthase 
Protein accessionYP_922149 
Protein GI119715184 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG0031] Cysteine synthase
[COG3620] Predicted transcriptional regulator with C-terminal CBS domains 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.44231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTCAC TCCTCGACCT GATCGGCAAC ACCCCGCTGC TGAGGCTCTC GACGTCCATG 
GGCTCCCTGA ACGGCGCGAA GGGACCGATC GTCCTCGCCA AGGTGGAGTA CCTCAACCCC
GGCGGCTCCG TGAAGGACCG CATCGCCACC CGGATGATCG AGGCGGCCGA GGCGTCCGGG
GAGCTTCAGC CCGGCGGCAC CATCGTCGAG CCGACGTCCG GCAACACCGG CGTCGGGCTG
GCGATGGTCG CCCAGGCGAA GGGCTACAGG TGCGTCTTCG TCTGCCCGGA CAAGGTCAGC
GAGGACAAGC GCAACGTGCT GAAGGCGTAC GGCGCGGAGG TGGTCGTCTG CCCGACCGCG
GTCGAGCCGG AGCACCCCGA CTCCTACTAC AACGTCTCCG ACCGGCTCGC CTCGCAGCCG
GGTGCCTGGA AGCCGGACCA GTACTCCAAC CCGCACAACC CGCGGTCGCA CTACGAGACG
ACCGGCCCGG AGATCTGGGC GCAGACCGAG GGGCGGGTCA CCCACTTCGT CGCCGGCGTC
GGCACCGGCG GCACCATCAG CGGCACCGGG CGCTACCTCA AGGAGCAGAA CTCCTCGGTC
CAGGTCATCG GGGCCGACCC GGCGGGCTCG GTCTACTCCG GCGGCACCGG CCGGCCCTAC
CTCGTCGAGG GAGTGGGCGA GGACTTCTGG CCGGAGGCCT ACGATCGCGA CGTCGCCGAC
CGGATCATCG AGGTCTCCGA CGCCGACTCG TTCGCGATGA CGCGGCGGCT GGCCCGCGAG
GAGGCCCTGC TGGTCGGCGG TTCCTCCGGC ATGGCCGTGC ACGCGGCGGT CCAGCTCGCC
CACGAGCTCG CCGGCACCCC CGAGGGCGAG GACGCGGTGA TCGTCGTACT CCTCCCGGAC
TCCGGCCGCG GCTACCTCAC GAAGGTCTTC AACGACGACT GGCTCGCGCA GTACGGATTC
CCGGTCGACG GCGCCGAGCG CTCCGTGCAG TCCGTCGGGG AGGTGCTCCG CGGCAAGAGC
GGGCGGCTGC CCGACCTCGT GCACACCCAC CCGAACGAGA CCATCGCCGA AGCCGTCGCG
ATCCTCCAGG AGTACAACGT CTCCCAGATG CCGGTCGTGC GCGCGGAGCC TCCGGTGGTG
GCCGCCGAGG TCGTCGGATC GGTCTCCGAG CGGACCCTGC TCGACCTGCT GTTCACCGGC
TCGGCCAAGC TCACCGACAG CGTCGGCGAG CACATGGCGC CCCCGCTGCC GACGATCGGC
TCCACCGAGC CCGCCTCCGA GGCCGTCGCC GCACTCGAGG GCGCCGACGC CCTGTTGGTG
CACGAGGACG GCAAGCCCGT CGGCGTCGTC ACCCGCCACG ACCTGCTGGC CTACCTCGCG
CGCGGCTGA
 
Protein sequence
MNSLLDLIGN TPLLRLSTSM GSLNGAKGPI VLAKVEYLNP GGSVKDRIAT RMIEAAEASG 
ELQPGGTIVE PTSGNTGVGL AMVAQAKGYR CVFVCPDKVS EDKRNVLKAY GAEVVVCPTA
VEPEHPDSYY NVSDRLASQP GAWKPDQYSN PHNPRSHYET TGPEIWAQTE GRVTHFVAGV
GTGGTISGTG RYLKEQNSSV QVIGADPAGS VYSGGTGRPY LVEGVGEDFW PEAYDRDVAD
RIIEVSDADS FAMTRRLARE EALLVGGSSG MAVHAAVQLA HELAGTPEGE DAVIVVLLPD
SGRGYLTKVF NDDWLAQYGF PVDGAERSVQ SVGEVLRGKS GRLPDLVHTH PNETIAEAVA
ILQEYNVSQM PVVRAEPPVV AAEVVGSVSE RTLLDLLFTG SAKLTDSVGE HMAPPLPTIG
STEPASEAVA ALEGADALLV HEDGKPVGVV TRHDLLAYLA RG