Gene Cag_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1109 
Symbol 
ID3748327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1497068 
End bp1498324 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content46% 
IMG OID637773640 
Productdiaminopimelate decarboxylase 
Protein accessionYP_379414 
Protein GI78189076 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTACAGA GTACTTATTT TTCTTACTGT AATGGTGCCT TGCTTTGCGA TGGTGTGCCG 
CTTCAAACCT TAGCAGAGCA GTACGGAACG CCTCTTTATG TAACAAGCGA GCGAAGCCTT
ATTGAGAGTT ATGAGGCGTT TGAGCGTGCC TTTCAGCCAT TACATCACTT TACTTGTTAT
TCGGTTAAAG CGAATTATAA CCTTAGTGTT ATTAAAACTT TTGCGGCGCT TGGTTGCGGG
TGCGATGTAA ATTCAGGCGG TGAATTGTAT CGTTCGTTGC AAGCTGGTGT GGCGCCTGAC
CAAATAATTT TTGCCGGAGT TGGTAAAAAA TATGAAGAAA TTGCGTATGC TCTCGAGTCG
GGAGTTTTAA TGTTGAAGGC GGAATCGGTT TCGGAGCTTC ATGTTATTAA TCGCATTGCG
GCTGAGCAGG GTAAAATTGC CTCAATTGCG TTGCGTATTA ATCCAAACGT AACGGCTGAA
ACGCATCCTT ACATTACCAC AGGTGATAGC AAAGAGAAGT TTGGTATTGA TGAAGCTGAT
TTAGCAGATC TTTTTGCTTT GATTCGTCAG TTGCCACATG TGCGCTTAAT TGGGCTTGAT
ATGCACATTG GCTCACAAAT TTTTGATCCT GAGTATTATG TAGCAGCAAC CACAAAACTT
CTTGCTCTTT TCAATTTGTC GAAATCAATG GGGTTTGCTC TTGAGTACCT TGATATTGGT
GGTGGCTTTC CTGTAACCTA TACCGACACA AAGCATGCTA CGCCAATTGA GCGCTTTGCT
GAAAAGTTAG TTCCACTGTT GCAGCCACTT GGGGTAACGG TAATTTTTGA ACCTGGACGC
TATTTGGTGG CAAATGCGTC GGTATTGCTT ACGCGTATTT TGTACCGCAA GCGTAACCAT
GTTGGCAAGG AGTTTTTTGT GGTTGATGCT GGTTTAACCG AGCTGATTCG TCCAGCGCTT
TACCAATCGC ACCATGAGGT GCAAGCGGTG CAGCGCCAAG AGGCATCTGT GATTGCCGAT
GTGGTTGGTC CAGTATGTGA GTCAAGCGAC TTTTTTGCTC GCCAGCGTGA GCTTGATGCG
GTGCCTGAAG GTGGTTTGCT TGCGGTGATG TCGTGCGGGG CGTATGCTTC GGTTATGGGT
AGCAATTACA ATGGGCGTTT GCGTCCAGCC GAGGTAATGG TGCGCCGTAA TGGTGAGGTT
GTGCTTACGC GCCGTCGCGA AAGCTTTGAG CAGTTAATTC AGAATGAAGT GCTGTAA
 
Protein sequence
MLQSTYFSYC NGALLCDGVP LQTLAEQYGT PLYVTSERSL IESYEAFERA FQPLHHFTCY 
SVKANYNLSV IKTFAALGCG CDVNSGGELY RSLQAGVAPD QIIFAGVGKK YEEIAYALES
GVLMLKAESV SELHVINRIA AEQGKIASIA LRINPNVTAE THPYITTGDS KEKFGIDEAD
LADLFALIRQ LPHVRLIGLD MHIGSQIFDP EYYVAATTKL LALFNLSKSM GFALEYLDIG
GGFPVTYTDT KHATPIERFA EKLVPLLQPL GVTVIFEPGR YLVANASVLL TRILYRKRNH
VGKEFFVVDA GLTELIRPAL YQSHHEVQAV QRQEASVIAD VVGPVCESSD FFARQRELDA
VPEGGLLAVM SCGAYASVMG SNYNGRLRPA EVMVRRNGEV VLTRRRESFE QLIQNEVL