Gene Cag_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2023 
Symbol 
ID3747996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2562192 
End bp2563607 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID637774560 
Productaspartate kinase III 
Protein accessionYP_380314 
Protein GI78189976 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTTA TGAAGTTTGG CGGGACGTCG GTTGGTAACG CCCGTGCAAT GCAACAAGCT 
ATTGCAATTG TTGCAAATAA AGAAAAAAGT GGTGCGCCAC TTGTTGTGCT GAGTGCTTGC
AGTGGCATTA CCAATAAGCT TATTCATATT GCTGATGCCG CTGGACGGAG TGCGCTTGCT
GAGGCTATGG TGTTGGCTGC CGAAGTTCGT GCTTTTCACC TTGCGTTAGC CGCTGAATTG
GTAACAACAC CTGAACGTCT TCATACACTC ACCACCTCCA TTACCGAGTT AGTTGATCGG
CTTGAAATGC TCATTAAGGG CGTTGATATT GTGGGTGAAC TTACGGAGCG TTCAAAAGAT
ATGTTTTGCT CGTTTGGCGA ACTCTTTTCA ACCACCATTT TTGCCGCTGC TATGCAAGAG
CGTGGGCATA ATGCTGCGTG GGTTGATGTG CGCACGGTAA TGATTACCGA CGATAATTTT
GGCTTTGCCC GTCCGCTTGA TAGCGTATGT GAAGCAAATG CTCTTTCGAT TATTCGCCCG
TTGCTTGAGC AAGGAACCAC CGTTGTTACG CAAGGCTACA TTGGTGCAAC GCGCGATGGA
CGCACCACAA CACTTGGGCG TGGTGGCTCC GACTTTTCTG CCGCCTTGCT TGGCGCATGG
CTTGATGATT CGGTGATTGA AATTTGGACG GATGTTGATG GCGTTATGAC CTGCGACCCT
CGCCTTGTGC CCGATGCGCG TAGCATTCGC GTGATGACCT TTACCGAAGC GGCTGAACTT
GCCTACCTTG GCGCTAAAGT GCTTCACCCC GACACCATTG CCCCTGCTGT TCAAAAAAAT
ATTCCTGTTT ACGTCCTCAA TTCCATTCAT CCTGAAGCAA AAGGCACCCT CATTACCAAC
GACTCGGAGC ACCTTTCGGG GATGAGTTAC GAAGGGTTAG TAAAGTCCAT TGCGGTGAAA
AAAAACCAGT GCATTATTAA TGTACGCTCT AACCGCATGA TGGGGCGCCA CGGTTTTATG
AGTGAACTTT TTGAACTCTT TGCCCACTAT GGTGTTTCGG TTGAAATGAT TTCAACCAGC
GAAGTTTCTG TTTCGCTGAC GGTGGATGAT AAATGCGTTA CGAGTGAGCT TATTCAAGCA
CTTGGCTCAC TTGGCGATAC TGAAATTGAG CACAATGTTG CCACCATTAG CGTGGTTGGC
GATAATTTGC GCATGTCGCG CGGTGTGGCT GGGCGCATTT TTAGTTCACT CAAAGAGGTG
AATTTGCGCA TGATTTCACA AGGCGCTTCG GAAATTAACG TTGGTTTTGT TGTTGATGAG
GCGGAAGTAG CCACTGCCGT AAATACGCTG CACAAGGAGT TTTTCTCCAC CCCAAATGAT
CACGCAATTT TTGAAAAACC GGCAGGGAGC CACTAA
 
Protein sequence
MAVMKFGGTS VGNARAMQQA IAIVANKEKS GAPLVVLSAC SGITNKLIHI ADAAGRSALA 
EAMVLAAEVR AFHLALAAEL VTTPERLHTL TTSITELVDR LEMLIKGVDI VGELTERSKD
MFCSFGELFS TTIFAAAMQE RGHNAAWVDV RTVMITDDNF GFARPLDSVC EANALSIIRP
LLEQGTTVVT QGYIGATRDG RTTTLGRGGS DFSAALLGAW LDDSVIEIWT DVDGVMTCDP
RLVPDARSIR VMTFTEAAEL AYLGAKVLHP DTIAPAVQKN IPVYVLNSIH PEAKGTLITN
DSEHLSGMSY EGLVKSIAVK KNQCIINVRS NRMMGRHGFM SELFELFAHY GVSVEMISTS
EVSVSLTVDD KCVTSELIQA LGSLGDTEIE HNVATISVVG DNLRMSRGVA GRIFSSLKEV
NLRMISQGAS EINVGFVVDE AEVATAVNTL HKEFFSTPND HAIFEKPAGS H