Gene Caci_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1201 
Symbol 
ID8332536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1355909 
End bp1356919 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content74% 
IMG OID644954348 
Producthomoserine kinase 
Protein accessionYP_003111967 
Protein GI256390403 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0083] Homoserine kinase 
TIGRFAM ID[TIGR00191] homoserine kinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.326603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGCC CTGTCTTCCG AGCCGCACCG GTGCGCGTCC GCGTCCCGGC CACCAGCGCG 
AACCTCGGAC CCGGCTTCGA CTCCCTGGGC CTGGCGCTCG GGCTGTACGA CGAGGTGATG
GTCCGGATAG CCGATTCCGG GCTGCGGGTG GACGTCGCGG GGGAGGGCGC CGACACGGTC
GCGCGCGACG AGCGGCACCT GGTCGTGCGG GCCATGCGCG CGGCGTTCGA GCGGCTCGGC
GCGCGGCCGC CGGGGCTGGA GCTGGTGTGC GCCAACCGGA TCCCGCACGC TCGAGGGCTG
GGCTCCTCGG CGGCGGCGAT CTGCGCCGGG ATCGTCGCGG CGCGGGCCCT GACCGTCGGG
GCGACGCTGT CCGACGACGC CGTGCTGCAG CTGGCCACCG AGATGGAGGG GCATCCGGAC
AATGTGGCGG CCTGCCTGCG CGGCGGCTTC ACCATCGCCT GGTTGGACCA AGCGGGTGAA
ATCTCCGACG CCGTCGGCGC GACCGCGCGG GTGCTGGCGA TCGAGCCCGC GCCGAGCCTG
CGGGCCGTGG CGTTCGTGCC GGACGAGGGC CTGTCGACCG AGGTCGCGCG GGGTCTGCTG
CCCAAACTGG TGCCGCACGC CGAGGCCGCG CGCAACGCCG GACGGTCCGC TCTGCTGTCC
GCTGCGGTCG TGCAGGGGCG CGCCGACCTG CTGCTGGCGG CCACGCAGGA CCGCCTGCAC
CAGGACTACC GGGCGCCGGC CATGCCGCGG ACCGCGGCGC TGATCGCCGA GCTGCGCGGC
GCCGGACACG CCGCGGTGGT CTCCGGCGCC GGCCCGACGG TCCTGGTTCT GACGACGGAA
GACCAGGTCC AGACCGTGAT CGCGGACGGC ATGAAGGTCG CGCCGGCCGG CTGGCAGGCG
TTCGGCCTCG CAGTGGACAA CGCCGGTGCG GTATCCTTGA ACTCGACCGA AGGCGCGGGG
CGCGGGTTGG ATTCCGATAA GGATTCAAAC CCACGCCACG GGGGACTGTG A
 
Protein sequence
MPSPVFRAAP VRVRVPATSA NLGPGFDSLG LALGLYDEVM VRIADSGLRV DVAGEGADTV 
ARDERHLVVR AMRAAFERLG ARPPGLELVC ANRIPHARGL GSSAAAICAG IVAARALTVG
ATLSDDAVLQ LATEMEGHPD NVAACLRGGF TIAWLDQAGE ISDAVGATAR VLAIEPAPSL
RAVAFVPDEG LSTEVARGLL PKLVPHAEAA RNAGRSALLS AAVVQGRADL LLAATQDRLH
QDYRAPAMPR TAALIAELRG AGHAAVVSGA GPTVLVLTTE DQVQTVIADG MKVAPAGWQA
FGLAVDNAGA VSLNSTEGAG RGLDSDKDSN PRHGGL