Gene Caci_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2033 
Symbol 
ID8333377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2303274 
End bp2304845 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID644955183 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_003112794 
Protein GI256391230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00105152 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.440854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGT CTTCGTCGCA GGGTCCTGTT TCGGCGGAGG ACGCGCGGAT CGCGCCGAAG 
CCTCGGTTGC GGCGGTTCCA TCAGGCGCGG TGGGACGAGC CGTTGATTTT TGAGCTGAGC
AGTCCGGGGG AGCGGGGGGT CGGGGTTCCG GTCACCGACC TTCCGGTGCC TTCGCTGCCA
GCGGGGTTGG CGCGTGCTGC GGCGCCCTTG CTGCCGGAGA TGTCGCAGCC GCATGTGCTG
CGGCACTACA TGCGGCTTTC GCAGGAGACG TTGGGGGTCG ATCTGAACGT CGACATCGGG
CAGGGCACGT GCACGATGAA GTACAGCCCG AAGGTGAACG ACTCCTTCGT CCGGGACGCG
CGGATCGCCG AGCTGCATCC GTTGCAGGAC GAGGGGACGG TGCAGGGCGT GCTGGAGATT
CTGTATCGGC TGGAGGGGTT GCTGAAGGAG ATATCCGGGA TGGACCGGGT GTCGTTGCAG
CCGGGATCGG GGTCCTCGGC GATCTATGCG AACGTGTCGA TGATCCGGGC GTACCACGCC
TCGCGGGGGG AAGGCGAGCT GCGGGACGAG GTCATCACGA CGCAGTTCTC GCACCCGACG
AACGCGGCGG CGCCGAAGAC CGCCGGGTAC CGCGTCATCA CCCTGATGCC GGACGCCGAC
GGGTATCCGG ACATCGAGGC GCTGCGGGCG GCGGTCGGCC CGCGGACGGC GGCACTGCTC
ATCACGAACC CCGAGGACAC GGGCATCTTC AACCCGCGCA TCGAGGAGTT CGTGCGGCTG
GTGCACGAGG CCGGCGGCCT GTGCTGCTAC GACCAGGCGA ACGCCAACGG GATCCTGGGG
ATCACGCGCG CTCGCGACGC CGGCTTCGAC CTGTGCCACT TCAACCTGCA CAAGACGTTC
TCCACACCGC ACATGTGCGG CGGTCCGGCG GCAGGCGCGT CCGCGGTGAC ATCGGCGCTG
GAACCCTTCC TCCCGCGACC GACCGTGGAG TTCGACGGGA CACGGTACCG ACTGGACGAC
GACCGCCCGG AGTCCATCGG GAAGATCCGC CCCTTCTACG GCGTGGTACC GAACCTCGTA
CGCGCCTACG CATGGATCAT GGCCCTCGGC GGAGAAGGCC TACGCACGGT CGCCGAGACA
GCGGCACTGA ACAACAACTA CTTGATTTCA AAGGTGCTGC AGATCAAGGG CGTCTCGCTG
CCCTACGCAC AGGGCCGGCG CCGAGTGGAG CAAGCACGCT ACAGCTGGCA GAAGCTGAAC
GCGGACACCG GCATCCACTC CGAGGAACTC GGCTACCGCG TAGCGGACTT CGGCACCCAC
TACTGGACCA GCCACCACCC CTACCTGGTC CCGGAACCCA TGACCCTCGA GCCGACGGAG
TCCTACTCGC AAGCGGACCT GGACGAATAC GCGGCGATCC TGGCCGAGGC CGCACGCGAG
GCCTACGAGG ACCCCGAGCT GGTCCGCAGC GCACCCCACA ACGGCCCGAT CCACCGCATG
CGAGACGCCT CGCTGGAGGA CCCGGGAACG TGGGCGGTGA CGTGGCGCGC GTACCGACGG
AAGCTCGGGT GA
 
Protein sequence
MSGSSSQGPV SAEDARIAPK PRLRRFHQAR WDEPLIFELS SPGERGVGVP VTDLPVPSLP 
AGLARAAAPL LPEMSQPHVL RHYMRLSQET LGVDLNVDIG QGTCTMKYSP KVNDSFVRDA
RIAELHPLQD EGTVQGVLEI LYRLEGLLKE ISGMDRVSLQ PGSGSSAIYA NVSMIRAYHA
SRGEGELRDE VITTQFSHPT NAAAPKTAGY RVITLMPDAD GYPDIEALRA AVGPRTAALL
ITNPEDTGIF NPRIEEFVRL VHEAGGLCCY DQANANGILG ITRARDAGFD LCHFNLHKTF
STPHMCGGPA AGASAVTSAL EPFLPRPTVE FDGTRYRLDD DRPESIGKIR PFYGVVPNLV
RAYAWIMALG GEGLRTVAET AALNNNYLIS KVLQIKGVSL PYAQGRRRVE QARYSWQKLN
ADTGIHSEEL GYRVADFGTH YWTSHHPYLV PEPMTLEPTE SYSQADLDEY AAILAEAARE
AYEDPELVRS APHNGPIHRM RDASLEDPGT WAVTWRAYRR KLG