Gene Caci_6634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6634 
Symbol 
ID8337998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7642861 
End bp7643961 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content72% 
IMG OID644959728 
Productglutamate--cysteine ligase GCS2 
Protein accessionYP_003117321 
Protein GI256395757 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.136582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGA CTGTCCCGAT CGACCAAGTA CTCAGCATCG GTGTCGAAGA GGAATTCGTC 
CTCGCCGACG CCACCACCCG GGTGTCGGCG CCGCGGGCCG ACGACGTCGT GGAGAAGGCG
CGGCTGCGGC TGGGCGACAA CGCTCAGCAC GAATTCTTCG CCACACAGGT GGAGTTCACC
ACGCGGCCTC GCATGACCGC CGAGGAGGTG CGCGCCGAAC TCGTCCGGGG ACGCCAGGCC
GGCGCGGCTG CCGCGGCGGA CACCGGCTGC CTGCTGGTCG CCGGGGGCAG CGCCGTGCTG
AACCGCTCGC CGCTGCCCGT CGCGCCGAAC GCCCGCTACG AGACCATCGC GCGCCGCCAC
CTCGGCGGCA TGCGCAGCGA GTCCAGCGGG TGCCACGTCC ACGTCGGTAC GCTGACGCGC
GGCGACGCGC TGCTGCTGAG CAACCACCTG GGACCGTGGC TGCCGGCCCT GCAGGCGTTG
TGCGTGAACT CGCCCTTCGC CGCCGGGGAG GACCGCCACT GCGCGAGCTG GCGCCACTTC
GACATCCAGG CGCTGCCGAC CGTCGGGCCG ACGCCGATCC TGGACGAGCC GGCCTACGAG
CGCACCGCGG ACAGGCTGGT CGCTGACAGG ACCCTGCTGG ACCGCAAGAT GATCTATTGG
TACGCCCGGC CGTCCGAGCA CTGTCCCACC TTGGAGATCC GGATCGCCGA CGCCAACCCC
GACCTCGACG TCGTCATGCT CTTCACGCTC CTGCTGCGCG GACTTGCGAC GACGTTGCTG
GCGGAGGCGC GGTACGGCCG TCCGTGGCCC AGTATGGACC GACGGTTGCT GACCGAGGCC
CACCGCAGGG TCGCGGTGGA CGGCCTGCCC GCCCTCACCA CCGATCCCCG GACCGGGATG
CTGATCTCGA CGGCCGCACT GCTGGACCGA CTGGTCGAGC GCAGCCGCCC GGGCCTGGCC
GCCGCGGGTG ACGAAGACCT CGTGGCAGCG CTGCTGGCCC GGTTCCACTC GCGCGGCACT
CCTGCCGACC GGCAGCGTGC CGTGTATCGG GAGCGTGGAC GTCTGGCCGA TGTCGTGGAC
TGGCTCGCGG TGCGGCCGTA G
 
Protein sequence
MAATVPIDQV LSIGVEEEFV LADATTRVSA PRADDVVEKA RLRLGDNAQH EFFATQVEFT 
TRPRMTAEEV RAELVRGRQA GAAAAADTGC LLVAGGSAVL NRSPLPVAPN ARYETIARRH
LGGMRSESSG CHVHVGTLTR GDALLLSNHL GPWLPALQAL CVNSPFAAGE DRHCASWRHF
DIQALPTVGP TPILDEPAYE RTADRLVADR TLLDRKMIYW YARPSEHCPT LEIRIADANP
DLDVVMLFTL LLRGLATTLL AEARYGRPWP SMDRRLLTEA HRRVAVDGLP ALTTDPRTGM
LISTAALLDR LVERSRPGLA AAGDEDLVAA LLARFHSRGT PADRQRAVYR ERGRLADVVD
WLAVRP