Gene Mlg_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0235 
Symbol 
ID4270862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp267857 
End bp269518 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content67% 
IMG OID638124959 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_741080 
Protein GI114319397 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.948942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGA CGCTATGGCC AGTGCCGGGG CGGTATGTCC CGTACCTGAT CTGTATCGCG 
GCCTTCGTGA TCTCGCTGCT GTGCCTGCAG GTCAGCGCCA CCTGGGGGTG GGGCGTGGCC
CTCTTCGGCG GCCTCTCATT GCTGGGCACC TGGGATCTGC TCCAGCCCCG CCGCACCATC
AGCCGCAACT ACCCGGTCAT CGCTCACTTG CGCTACTGCC TGGAAGGTAT CGGCCCGGAA
ATCCGGCAGT ACTTCATCGA ATCGGACACC GACGAGCGAC CCTTCTCCCG GGAACAGCGC
TCAGTGGTCT ACCAGCGGGC CAAGAACCAA CTGGACAAGC GCCCCTTTGG CTCCCTGCTG
AACCTCTACG GCGACGGTTA CGAATGGGTC AGCCATTCCG TCCAACCGGT AGCCGTTGAT
CCCTCGGCCT ACCGAGTGGA GATCGGCGGT CGCTGCCAAC AACCCTATTC GGCCAGCGTC
TTCAACATCT CTGCCATGAG CTTCGGCGCC CTTTCCGCCA ATGCCATCCT GGCCCTCAAC
AAGGGCGCGC GGCTGGGCGG CTTCTATCAG GACACCGGCG AGGGAGGCAT CTCCCGGTAC
CACCTGGAAC ACGGCGGAGA CCTGGTCTGG GAGATCGGTT CCGGTTATTT CGGCTGCCGG
ACCCCGGATG GCAGCTTCAG CCCCGAACGC TTCGCGGAGA CGGCCGGCCT GGACAGCGTC
CGCATGATTG AGATCAAGCT CTCCCAGGGC GCGAAACCCG GGCATGGCGG CATCCTGCCC
GCCGCCAAGG TGAGTCCGGA AATCGCTGCG GCCCGCGGCG TGCCCGAGGG TGAGGACGTG
ATCTCCCCGC CACGCCATTC CGCCTTTTCC ACGCCCCGGG AGCTGATGCA GTTCATCGGG
CAACTGCGTG AACTCTCCGG CGGCAAGCCG GTGGGCTTCA AGCTGGCCAT CGGCCACCCC
TGGGAGTGGT TCGCCCTGGC CAAGGCCATG CAGGCAAGCG ACGAGCGACC GGATTTCATT
GTCGTGGACG GCGGTGAGGG AGGCACCGGT GCCGCGCCCC TGGAGTCGAT CAACCGACTG
GGCATGCCGC TGGACGAGGC CCTGCTGCTG GTCCACAACA CCCTGGTGGG CACCGGCCTG
CGTGACCACA TCCGCCTGGG GGCCGCCGGC AAACTGACCA GCGGCTTCAA GGTCGCGCGC
ACCCTGGCGC TGGGCGCGGA CTGGTGCAAT GCCGCCCGTG GCTTCATGTT CGCGCTCGGC
TGCATCCAGT CCCTGAGCTG TCATACCGAC CGCTGCCCCA GCGGGGTGGC CACCCAGGAC
CGGCGGCGCA GCCGCGGCTT GCACGTGGGC GACAAGGCGC TGCGAGTACG CAACTTCCAC
GCAGGGACCG TGGAGGCGCT CGGCAGCCTG CTGGCCGCCG CTGGCCTGAG CCACCTCGAC
CAGCTCACAC CCGACCATAT CTACCGGCGC CTGTCCGGCA CCGAGGTCCG GAGCTTCGCG
GAACTCTACC CCTTCGTTGA AAAGAACGCG CTGCTGTCCG GCGCCCCCGC CTACCCGGCA
GTATTCCGTG AGTACTGGCC CAGGGCGTCG CCGGACACCT TTCACCCAGT CACCCCCAAA
CACCACCCCA GTGAGGAGGC CGCCATGAAA GGAACCGCGT GA
 
Protein sequence
MSKTLWPVPG RYVPYLICIA AFVISLLCLQ VSATWGWGVA LFGGLSLLGT WDLLQPRRTI 
SRNYPVIAHL RYCLEGIGPE IRQYFIESDT DERPFSREQR SVVYQRAKNQ LDKRPFGSLL
NLYGDGYEWV SHSVQPVAVD PSAYRVEIGG RCQQPYSASV FNISAMSFGA LSANAILALN
KGARLGGFYQ DTGEGGISRY HLEHGGDLVW EIGSGYFGCR TPDGSFSPER FAETAGLDSV
RMIEIKLSQG AKPGHGGILP AAKVSPEIAA ARGVPEGEDV ISPPRHSAFS TPRELMQFIG
QLRELSGGKP VGFKLAIGHP WEWFALAKAM QASDERPDFI VVDGGEGGTG AAPLESINRL
GMPLDEALLL VHNTLVGTGL RDHIRLGAAG KLTSGFKVAR TLALGADWCN AARGFMFALG
CIQSLSCHTD RCPSGVATQD RRRSRGLHVG DKALRVRNFH AGTVEALGSL LAAAGLSHLD
QLTPDHIYRR LSGTEVRSFA ELYPFVEKNA LLSGAPAYPA VFREYWPRAS PDTFHPVTPK
HHPSEEAAMK GTA