Gene Cag_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1968 
Symbol 
ID3747830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2500292 
End bp2501587 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content49% 
IMG OID637774504 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_380259 
Protein GI78189921 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCAAA TCACCCGTTC TATTGAACTC TTTGAAAAAG CAAAGAAGTT TATCCCCGGT 
GGCGTTAACT CACCAGTACG CGCCTTTAAA TCTGTTGGCG GCACACCAAT TTACATGGCA
AAAGGCTCTG GCGCTTACAT GACCGACGTG GACGGCAACA CCTACCTCGA TTACGTTGGT
TCATGGGGAC CATTCATTCT CGGCAGTATG CACCCACGCA TTACCGCTGC ACTTGAGTAC
ACGCTAAAAA ATATTGGCAC CAGCTTTGGT ACACCAATTG AGATGGAAAT TGAAATTGCT
GAACTGCTTT GCCAAATTGT GCCTTCACTT GAAATGGTGC GTATGGTAAA CAGCGGCACC
GAAGCCACCA TGTCAGCCGT GCGCCTTGCA CGCGGTTACA CCGGTCGCGA TAAAATCATC
AAATTTGAAG GTTGCTACCA CGGGCACGGC GATAGCTTCC TCATTAAAGC AGGTTCAGGC
GCTCTTACGC TTGGTGCTCC CGATAGCCCT GGCGTTACCA AAGGCACAGC TCAGGACACT
CTGAACGCAA CCTATAACGA CATCGAATCA GTAAAGTTGC TTGTTCAAGA GAACAAAGGC
AACGTTGCTG CAATTATTAT TGAACCTGTT GCTGGTAACA CCGGTGTTAT TCCAGCCCAA
CCCGGATTCC TTGCTGCACT CCGTCAGCTT TGCGACGAAG AAGGCATTGT GCTGATTTTT
GACGAAGTGA TGTGCGGCTT CCGCGTAGCA CTTGGCGGCG CACAAAGCCT TTATGGCGTT
ACCCCCGACC TTACCACAAT GGGCAAAATT ATTGGCGGTG GTCTGCCTGT TGGTGCATTT
GGCGGCAAAC GCAAGCTCAT GGAGCGCGTT GCACCACTTG GCGACGTTTA CCAAGCTGGT
ACGCTTTCAG GTAACCCGCT GGCACTGACC GCTGGTCTTG AAACCTTGAA AATTCTCATG
GATGAGAATC CATATCCAGA GCTTGAAAGA AAAGCTGTTA TTCTTGAAGA GGGCTTTAAA
GCAAACCTTG CAAAACTTGG CTTGAACTAT GTTCAGAACC GTGTTGGTTC CATGTCGTGC
CTCTTCTTTA CCGAAACGCC TGTTGTGAAC TACACAACCG CTATTACGGC TGATACCAAG
AAGCACGCCA AATACTTCCA CTCATTGCTC GATCAAGGCA TTTACACGGC TCCATCGCAG
TTTGAAGCAA TGTTCATCAG CTCAGTAATG ACCGACGAAG ATTTGGATAA AACCATCAAA
GCAAACTACA ACGCTTTGGT TGCTTCACAG CAATAA
 
Protein sequence
MPQITRSIEL FEKAKKFIPG GVNSPVRAFK SVGGTPIYMA KGSGAYMTDV DGNTYLDYVG 
SWGPFILGSM HPRITAALEY TLKNIGTSFG TPIEMEIEIA ELLCQIVPSL EMVRMVNSGT
EATMSAVRLA RGYTGRDKII KFEGCYHGHG DSFLIKAGSG ALTLGAPDSP GVTKGTAQDT
LNATYNDIES VKLLVQENKG NVAAIIIEPV AGNTGVIPAQ PGFLAALRQL CDEEGIVLIF
DEVMCGFRVA LGGAQSLYGV TPDLTTMGKI IGGGLPVGAF GGKRKLMERV APLGDVYQAG
TLSGNPLALT AGLETLKILM DENPYPELER KAVILEEGFK ANLAKLGLNY VQNRVGSMSC
LFFTETPVVN YTTAITADTK KHAKYFHSLL DQGIYTAPSQ FEAMFISSVM TDEDLDKTIK
ANYNALVASQ Q