Gene Mlg_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1661 
Symbol 
ID4270262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1900257 
End bp1902203 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content67% 
IMG OID638126419 
Productputative glutamate synthase (NADPH) small subunit 
Protein accessionYP_742497 
Protein GI114320814 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.950377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CCACCAACGA GATGCAGTCG CTGACCCTGC GCCGATTCAA GGAGGGCGAC 
CATCAGCCCA AGGACTGGCA GGAGCAGATC TTCCAGGCCG GCTGGTCGCA CAAGTGCCCC
ACCTATGTGC ACCGGACACC GCCGTGCCAG GGCAGTTGCC CGGCGGGCGA GGATATCCGG
GGCTGGTTGC AGATTGCCCG CGGACTGGAC AAGCCGACGG CGGACGAACC CTGGCAGGCC
TATGCCTTCC GTCGCCTCAC CGAGGCGAAC CCCTTTCCTG CGGTAATGGG CCGGGTCTGC
CCCGCCCCCT GCGAACAGGG GTGCAACCGT AACGCGGTGG AGGATCACGT TGGCATCAAC
GCCGTGGAAC ATAAGATTGG CGACTGGGCG CGGGAAAATG ATCTGAAATT CGATGCCCCG
GGCGAGCCCA CCGGCCGCCA CGTGGCGATC ATCGGCAGCG GCCCGGCCGG TATGGCCGCG
GCTTACCAAC TGCGCAAACG GGGCCATGCC TGCACCCTGT TCGAGGCCCA GGAGGAACTG
GGCGGCATGA TGCGCTACGG CATCCCGGGC TACCGCGTCC CGCGGCAGGT TCTCGATGCC
GAGATCCAGC GCATCCTCGA CCTGGGCGTG GAGGTCCGCA CCGGGGTCTG GGTCGGCCGG
GACATCACCA TCGAACAGCT CGACAACGAC TACGACGCCG TTCTCTGGGC GGTCGGCACC
CACAAAGGGC GCGACCTGCC GGTGGAGGGC TTCGAGGCGG CGCCCAACTG CCTCACCGGT
GTGGACTTCC TGCGGGCCTT CAACGAGGGC CGGCTGCACG CGGTGAGCGA CCGGGTCATC
GTGATCGGCG GCGGTGACAC CTCCATCGAC GTGGCCTCGG TCGCCCGCCG GCTTGGCTAC
AGCTCGGAGC TCGGCGACAA CCAGGGCGTG GAGCACGTGG TGATGGGCTA TACCGCCCAC
GATGCCGCCA GCCTGGCGGT GCGGGAAGGG GCCAAGGTCA CCCTCACCTC CCTGTTCCCG
CGCGAGGAGA TGACTGCCAC CGACCAAGAG GTGGAGGACG CCCTGCGCGA GGGGGTGGAC
ATCAAGGCCG GCGTCATGCC GGTGGCCGTA ATCACCGATG ACGAGGGCAG GGCCACCGCC
GTGCGCTTCG CCGAATGCCG GATGGAGAAA AACCGCCCCG TCCCCCTGGA AGGCACGGAG
TTCGAGGTCG AGACCGACCT GGTGATCTCG GCCATCGGCC AGATGGGCAA CCTGGAGGGG
CTGGAGGCGC TGGACAACGG CAACGGCTTC ATGGACTGCG ACCCCCACTT CCAGGTCAAG
GGTCGACCGG GGCACTTCGT GGCCGGGGAC ATCATCCGCC CGCACCTGCT GACCACCGCC
ATCGGCCAGG CCCGCAGCGC GGTCGCCAGC ATGGATCACT ACTTCCAGAC CGGCGAACCC
GCCAAGTTCC CCAAGATCAA CGTCCTGCAC TTCAACCTGC TGCAGGCGAT GCGCAAGGCG
GGCCAGGAGC CGACGCCCTA CGAGCCGCAG CCGGTGCGCG GCACCGCCGA CTCGGCTTTC
GCCGTCCACA ACTACGAGGA CCGCTCCAAG GTCGAGATCA TCAAACACGA CCAGCTCTTC
CTCGGCCATT TCAAGCCGAC ACCACGCCAC CAGCGCCAGC ATCGCGAGAT CAGTGAAGAC
TCGGTGATCG GTGATTTCGA TGAGCGGCTC CATCCGTTGT CCGATGAGGA GGCCGTCGCT
GAGGCCGAGC GCTGCATGAG TTGCGGCCTC TGCTTCGAGT GCGACAACTG CCTGATCTAC
TGCCCCCAGG ACGCGGTCGA GCGGGTGCCG AAAAAGGAAC GCGCGACCGG TCGCTACGTG
CAGACCGATT ACACCCGCTG CATCGGCTGT CATATCTGCC GCGATGTCTG CCCCACCGGT
TACATCGAGA TGGGGCTGGG GGAATAA
 
Protein sequence
MSTTTNEMQS LTLRRFKEGD HQPKDWQEQI FQAGWSHKCP TYVHRTPPCQ GSCPAGEDIR 
GWLQIARGLD KPTADEPWQA YAFRRLTEAN PFPAVMGRVC PAPCEQGCNR NAVEDHVGIN
AVEHKIGDWA RENDLKFDAP GEPTGRHVAI IGSGPAGMAA AYQLRKRGHA CTLFEAQEEL
GGMMRYGIPG YRVPRQVLDA EIQRILDLGV EVRTGVWVGR DITIEQLDND YDAVLWAVGT
HKGRDLPVEG FEAAPNCLTG VDFLRAFNEG RLHAVSDRVI VIGGGDTSID VASVARRLGY
SSELGDNQGV EHVVMGYTAH DAASLAVREG AKVTLTSLFP REEMTATDQE VEDALREGVD
IKAGVMPVAV ITDDEGRATA VRFAECRMEK NRPVPLEGTE FEVETDLVIS AIGQMGNLEG
LEALDNGNGF MDCDPHFQVK GRPGHFVAGD IIRPHLLTTA IGQARSAVAS MDHYFQTGEP
AKFPKINVLH FNLLQAMRKA GQEPTPYEPQ PVRGTADSAF AVHNYEDRSK VEIIKHDQLF
LGHFKPTPRH QRQHREISED SVIGDFDERL HPLSDEEAVA EAERCMSCGL CFECDNCLIY
CPQDAVERVP KKERATGRYV QTDYTRCIGC HICRDVCPTG YIEMGLGE