Gene Ppha_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2001 
Symbol 
ID6462966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2089589 
End bp2090725 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content55% 
IMG OID642728200 
ProductCystathionine gamma-synthase 
Protein accessionYP_002018830 
Protein GI194337036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.591425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTG AAACCCTTGC CATTCATGAC GGACAGGCTC CTGATCCCCA TACAGGATCA 
GTAACCGTCC CTGTTTACCA GGCCTCTACC TTCGAGCGTG CTGACCTGCA GCGTCACGGT
GAGTTCTTCT ATTCGAGAAT CGGCAATCCG ACGAGAGCGG CGCTCGAATC AACCATCGCA
CTGCTTGAAA ACGGTCGTTA TGGCCTCGCC TTCGCCTCCG GAGTTGCCGC TACCCTGGCC
GCGCTCCAGG TGGTCAAACC CGGAGAGCAT ATTGTGGCAA GCAACGATAT CTATGGGGGA
AGTTACCGCA TTTTTGAGCA ACTCCTCCGT CCGCTGGGGG TCACAACAAG CTATGCAGAA
AGTTCAGATA CCGCGAGTTA TAAGGCATGC ATCACCCCGG AAACCAGACT GATCTGGATT
GAAACCCCGA GCAACCCGCT TCTTCAGCTC TCCGACATCC GCGCTCTTGC CTCACTGGCA
AAAGAGCGCG GCATCCTGCT CGCGGTCGAC AACACCTTTT TAAGCCCCTA CTTCCAGCGT
CCGCTTGAAC TTGGAGCCGA TATTGTTGTC CACAGCACCA CAAAATATCT TGGCGGGCAC
AGTGACGTCA TCGGTGGCGC TGTCATCACC TCGGATGCAG CGCTGCACAC CATCATCAAA
AACTATCAGG CCGCAGCGGG AGCCATACCG GCGCCCTGGG ACTGCTGGCT CATTCTTCGT
GGATTGAAAA CGCTTAAAAT CCGCATGAAA GAGCATGAGG CAAACGCCCT GCATCTTGCC
AAATTTCTTG AACAGCACCC GGCAGTGGAG CAGGTTTTTT ATCCAGGCCT CCCATCCCAC
CCGCAGCATG AGCTGGCGAA ACAGCAGATG AGCGGGTTCG GGGGAATGGT AACCTTTGCC
CTTAAAGGGG GGCTTCCGGC TGTCGAACAA CTGGTCGCAC GAATCAAACT GTTTATCCTT
GCCGACAGCC TCGGCGGTGT AGAGTCGCTC ATCGCATCAC CGGCCAAAAT GACACTCGGA
GCCCTCTCGA TTGAGGAGCG TGCGCGGAGG AGATGTACCG ACAACCTCGT ACGCCTCTCT
GTCGGGCTTG AAAATGCAGA GGATCTGGAG GCGGATTTGT TGAACGCGAT TACGTAA
 
Protein sequence
MQFETLAIHD GQAPDPHTGS VTVPVYQAST FERADLQRHG EFFYSRIGNP TRAALESTIA 
LLENGRYGLA FASGVAATLA ALQVVKPGEH IVASNDIYGG SYRIFEQLLR PLGVTTSYAE
SSDTASYKAC ITPETRLIWI ETPSNPLLQL SDIRALASLA KERGILLAVD NTFLSPYFQR
PLELGADIVV HSTTKYLGGH SDVIGGAVIT SDAALHTIIK NYQAAAGAIP APWDCWLILR
GLKTLKIRMK EHEANALHLA KFLEQHPAVE QVFYPGLPSH PQHELAKQQM SGFGGMVTFA
LKGGLPAVEQ LVARIKLFIL ADSLGGVESL IASPAKMTLG ALSIEERARR RCTDNLVRLS
VGLENAEDLE ADLLNAIT