Gene Cag_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1954 
Symbol 
ID3746714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2482256 
End bp2483995 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content47% 
IMG OID637774489 
Productputative glutamate synthase (NADPH) small subunit 
Protein accessionYP_380245 
Protein GI78189907 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.151886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTAG AATCAAATCC AATCCTTGAT TTTGCAATAA ACTACCAGTT TCCTCCTTTT 
GAGGAGCTTA CTGGTACTCA TAAAATTGTT GCATTTGGGG ATCACAGTCA TAAGTGTCCT
GTCTATGTTC GGCAAACTCC GCCTTGTCAG GCAGAGTGCC CAGCAGGTGA AAATATTCGT
GGCTACCATC GTTTTTTGAA TGGTATTGAT AAATCGGAAG ATGAATGGAA ATCAGCATGG
GAAACCTTAG TTGAAATAAA TCCATTTCCG GCGGTAATGG GGAGAATTTG CCCCCATCCA
TGCCAAAGCG CATGTAACCG TCAATATCAC GACGAAAGTG TGGCTATTAA CGCCGTTGAG
CAAGCAATAG GCAATTACGG CATTCAAGCA GGCTTGCAGC TTCCCGAACC AGCGCCAGCA
ACGGGCAAAC GTGTTGCCGT CATTGGTGGT GGTCCAGCAG GTTTATCGTG CGCCTATCAA
TTACGGCGTC GTGGACATGC TGTTACGCTG TACGATGCTA ATGAAAAGCT TGGTGGAATG
GTGCTTTACG GTATTATGGG CTACCGTGTT GACCGTAAAG TACTTGAAGC TGAAATCCAA
CGCATCATTA ACCTTGGTAT TGAAACCAAG ATGGGTGTTC GTGTTGGTAG CGATGTAACA
CTTGATGAGC TTGAGCAAGA GTTCGATGCC GTTTTTATTG GTATTGGTGC TCAAGCAGGT
CGTTCATTAC CTGTAGCTGG TGCAGCGGAA ACTCAGGGAG TTACCAATGC TATTGAGTTT
CTTAGAAGCT ATGAGGTAGA GGGCGATAAC ATTACCATTG GTAAAAAAGT ACTTGTTATT
GGCGATGGTA ACGTTGCTAT GGACGTTGCT CGCCTTGCTT TGCGTCTTGG TTCAGAAGCG
GCTGTTGTTG CTGGTGTGCC TCGCGAAGAG ATGGCTTGTT TTAAAGAAGA GTTTGACGAT
GCCGATCACG AAGGTGCTGT TATGCACTTT ATGAGCGGAG CGCTTGAACT GCTAAAAAAT
GATGATGGTT CTGTTCGTGG GTTGCGTTGT GCTAAAATGG TGAAAAAAGC CAAAGGCGAA
GAGGGATGGA ATTCACCAAT TCCATTCTTC CGCTATAAAA ACAGCGATGA AACCTTTGAC
ATTGAAGCCG ACACTGTGGT TGCTGCCATT GGGCAAACAA CCAACATGCA AGGCTTTGAA
GCCATTACCA ATGGAGCGCC TTGGTTGAAA GTTGATCGCT CATTCCGCAT TCCGGGACGC
GAAAAACTCT TTGGTGGTGG TGATGCCCTT AAGGTTGATC TTATCACTAC TGCTGTTGGG
CATGGACGTA AAGCCGCAGA GGCTATTGAT GCCTTCTTAA AAGGTGAGCC AATGCCCGAT
CAAGGTTACC GTGAAGTTAC GAAGGTGAGC CGTCAGGATG TTCTTTACTT CCCTGTTACG
CCGCCAGCAA AGCGCGATAC CATTAAAATT CAAGAGGTTG TTGGCAACCA CGATGAATTG
TTAGTTGCCT TAACGCCTGA GCAAGCAAAA GCTGAGTCGG GTCGCTGCAT GAGCTGCGGC
TTGTGCTTTG ATTGTAAGCA GTGTGTTTCG TTCTGCCCGC AAGAAGCAAT TTCTCGCTTC
CGCGATAATC CTGTAGGCGA AGTAGTTTAT ACCAATTACG ATAAGTGTGT TGGTTGCCAC
CTCTGCTCGT TAGTGTGTCC TTCGGGCTAC ATACAAATGG GTATGGGTGA TGGCTTATAG
 
Protein sequence
MKVESNPILD FAINYQFPPF EELTGTHKIV AFGDHSHKCP VYVRQTPPCQ AECPAGENIR 
GYHRFLNGID KSEDEWKSAW ETLVEINPFP AVMGRICPHP CQSACNRQYH DESVAINAVE
QAIGNYGIQA GLQLPEPAPA TGKRVAVIGG GPAGLSCAYQ LRRRGHAVTL YDANEKLGGM
VLYGIMGYRV DRKVLEAEIQ RIINLGIETK MGVRVGSDVT LDELEQEFDA VFIGIGAQAG
RSLPVAGAAE TQGVTNAIEF LRSYEVEGDN ITIGKKVLVI GDGNVAMDVA RLALRLGSEA
AVVAGVPREE MACFKEEFDD ADHEGAVMHF MSGALELLKN DDGSVRGLRC AKMVKKAKGE
EGWNSPIPFF RYKNSDETFD IEADTVVAAI GQTTNMQGFE AITNGAPWLK VDRSFRIPGR
EKLFGGGDAL KVDLITTAVG HGRKAAEAID AFLKGEPMPD QGYREVTKVS RQDVLYFPVT
PPAKRDTIKI QEVVGNHDEL LVALTPEQAK AESGRCMSCG LCFDCKQCVS FCPQEAISRF
RDNPVGEVVY TNYDKCVGCH LCSLVCPSGY IQMGMGDGL