Gene Clim_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2002 
Symbol 
ID6355506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2221919 
End bp2223361 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content54% 
IMG OID642669600 
Productputative oxidoreductase 
Protein accessionYP_001944013 
Protein GI189347484 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01316] glutamate synthase (NADPH), homotetrameric
[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.14117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTAA AACCTATGAC GACCAAGGAG CGCCTTGCCA TACCGCGCCA GACCATGCCT 
GCACAGGATC CTGCCGAGCG GGTTCGGAAC TTCAATGAGG TCAACCTCGG CTATACACCC
GAGATAGCAC AGAAAGAGGC CATGCGCTGC ATTCAGTGCA AGGACCCCAT ATGCATCAAG
GGCTGTCCCG TAAATATCAA AATAGACAAG TTCATCAAAC TGATCGCGGA GGGAAATTTT
CTCGGAGCCG CAAAAAAAAT CAAGGAGGAC AACATTCTTC CGGCAATCTG CGGCAGGGTG
TGCCCCCAGG AAGATCAGTG CGAAAAAGTA TGCGTTCTGA CCAAAAAGTA TACCCCGGTG
GCTATCGGAA ACCTTGAACG GTTTGCCGCC GATTACGAAC GTGAACATGG ACAGATCGAG
CTGCCTGAAG TGCAGGCGTC GACAGGGAAA AAAATAGCCG TTATCGGCAG CGGGCCGGCC
GGCCTGAGCT GTGCGAACGA TCTGGCCCGT TTCGGCCACT CCGTTACGGT TTTCGAGGCA
CTGCATGAAC TCGGAGGCGT TCTGATGTAT GGAATTCCGG AATTCCGGCT TCCGAAGGAT
ATTGTAAAGA TCGAAATCGA GGGACTGAAA AAGATCGGGG TAGAGTTTGT CGCCAATACC
GTTGTCGGAC GATCGGTTAC CATCGATGAA CTGATGAATG ATGAGCATTT CGATGCCGTC
TTTATCGGAG TTGGAGCCGG TCTGCCGTGG TTTATGGGCA TTCCCGGCGA AAACCTTCTC
GGGGTTTATG CCGCCAACGA GTTTCTCACG AGGGTAAACC TGATGCAGTC CTATACCTTC
CCTGAGAACG ACACCCCGGT TTTCGATTGC AAAGGGAAAA ATATCGCCGT CTTCGGTGGC
GGCAATACCG CTATGGATGC CGTTCGTACC GCGAAAAGAC TTGGAGCGGA ACATGCCTAT
ATCGTCTATC GTCGTTCTGA AGCCGAAATG CCGGCCCGCG CCGAAGAGAT TCATAATGCG
AAAGAGGAAG GCATCGAATT TCTGCTGCTG ATGAACCCGG TCGAATTTAT CGGTAACGAG
CAGCAGTGGC TGACCGGAGT GAAGTGCCTC CGGATGGAGC TGGGAGAGCC GGACGATTCC
GGTCGCCGTA AACCGGTACC GGTACAAGGT TCGGAATTCG TGCTGCCTGT CGATATGGCG
GTAATCTCCA TAGGAAACGG CTCGAATCCG CTCATCAAGC AGACAACTCC CGAAATCGAA
GTCAGCAGAA GGGACACCAT CGTCGTCGAT ATCAACACCA TGGCAACCTC CAAAGAGAAC
GTGTATGCCG GAGGCGATAT CGTCACAGGC GGCGCTACCG TCATTCTGGC CATGGGGGCG
GGACGTACGG CTGCGAAAGC CATTCATGAA CAGCTCTCGG AGCAGGCATC GAATCCCCTA
TGA
 
Protein sequence
MDLKPMTTKE RLAIPRQTMP AQDPAERVRN FNEVNLGYTP EIAQKEAMRC IQCKDPICIK 
GCPVNIKIDK FIKLIAEGNF LGAAKKIKED NILPAICGRV CPQEDQCEKV CVLTKKYTPV
AIGNLERFAA DYEREHGQIE LPEVQASTGK KIAVIGSGPA GLSCANDLAR FGHSVTVFEA
LHELGGVLMY GIPEFRLPKD IVKIEIEGLK KIGVEFVANT VVGRSVTIDE LMNDEHFDAV
FIGVGAGLPW FMGIPGENLL GVYAANEFLT RVNLMQSYTF PENDTPVFDC KGKNIAVFGG
GNTAMDAVRT AKRLGAEHAY IVYRRSEAEM PARAEEIHNA KEEGIEFLLL MNPVEFIGNE
QQWLTGVKCL RMELGEPDDS GRRKPVPVQG SEFVLPVDMA VISIGNGSNP LIKQTTPEIE
VSRRDTIVVD INTMATSKEN VYAGGDIVTG GATVILAMGA GRTAAKAIHE QLSEQASNPL