Gene Mlg_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1239 
Symbol 
ID4269023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1443650 
End bp1444963 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content71% 
IMG OID638125989 
ProductFolC bifunctional protein 
Protein accessionYP_742078 
Protein GI114320395 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.168236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.703163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA CCCCGGGATC GGCCGCCCCA ATGCCCCAGC GCGACCGTTG GCGGCTGGAG 
GACTGGCTGC GGTGGCAGGA AGGGCTCAGC CCGGTGGAGA TCAACCCCGG GCTGGAGCGG
GTGCAGGCGG TCGGCGAACG GCTGGGTGCC CTCACGCCCC GTTGCCCGGT CATTACCGTG
GCCGGTACCA ACGGCAAGGG CTCCTGCATC GCCTACCTGG AGGCAATGCT CGGTGCCGCC
GGATACCGGA CCGCTGCCTA CACCTCGCCG CATCTGTTGC GCTATAACGA ACGCATCCGG
CTGGCCGGGG TCCCGGTGAG TGATGAGGCC ATCACGGCGG CCTTCTCCCG CGTGGAACAG
GCCCGGCAGG GCACCCCGTT GACGTATTTC GAGTACGGCA CGCTGGCCGC TCTGAGCCTG
TTCAGCGAGG CGGAGGCCGA GGTCTGGCTC CTCGAGGTGG GCATGGGCGG GCGCCTGGAC
GCGGTGAACG CGGTCGATCC CGATTTGTCC ATAATCACGA GTATTGGCCT CGATCACACC
GAGTGGCTGG GCGCGGATCG CGAGCGGATT GGCGCGGAGA AGGCCGGTAT CATGCGTCCG
GGACGGCCCG TCTGCCTGGG CCAGGCGGAC CTCCCCGACA GTGTGTCCGA TCGGGCCCGG
ACGCTGCGGG CGCCGGTGAC CGCGGCCGGT CGCGACTTCC ATTGGCGGCG ACAAGCCCTG
GGCTGGGACT GGCTCAGCGG CGACGAGCGA CTGGACGGCC TGCCCTGGCC CGGGCTGACC
GGGACGGTGC AACTGGATAA CGCGGCGGTG GTCATCGCCG GTCTGAGGCG GCTGCGGGAG
CGGCTCCCGG TGGATCGCGC CGCGCTCGAG CGGGGACTGC GCAGCGCCCG CCTGCCCGGA
CACATGGAGC GGGTCCGGCG CCGGGGCGTG GAGTGGTTGT TCGACGTGGC CCACAATGAG
GACAGCGTGC GCGTATTGGC CGAGACGGTC CGGGACGAGG CGGGCAAGGG GCGCGTCATC
GGGCTCTTTG CCGCCATGCA CCGCAAGGCC CTGTCCGGTG TGCTTGCCAC CATGGGTGCA
GTGGTGGACG AGTGGTATCT GCCACGGTTG GAGGATCCCC AGGCGCATCC GCCGGAGGCG
GTGGCGGCGG GCCTACGCGA GACTGGGGTG GATGCCTCCG TTATCCATAC CGGCGGCCTG
TCGGCCCTGC TTGCCGCGGT AGCGGACCGC GCCCGCCCCG GGGACCGGGT GGTGGTGTTC
GGCTCGTTCC GTACCGTCGA GGCGGTGATG CGGGCCGGAG GGCGCGTAGA CTGA
 
Protein sequence
MPDTPGSAAP MPQRDRWRLE DWLRWQEGLS PVEINPGLER VQAVGERLGA LTPRCPVITV 
AGTNGKGSCI AYLEAMLGAA GYRTAAYTSP HLLRYNERIR LAGVPVSDEA ITAAFSRVEQ
ARQGTPLTYF EYGTLAALSL FSEAEAEVWL LEVGMGGRLD AVNAVDPDLS IITSIGLDHT
EWLGADRERI GAEKAGIMRP GRPVCLGQAD LPDSVSDRAR TLRAPVTAAG RDFHWRRQAL
GWDWLSGDER LDGLPWPGLT GTVQLDNAAV VIAGLRRLRE RLPVDRAALE RGLRSARLPG
HMERVRRRGV EWLFDVAHNE DSVRVLAETV RDEAGKGRVI GLFAAMHRKA LSGVLATMGA
VVDEWYLPRL EDPQAHPPEA VAAGLRETGV DASVIHTGGL SALLAAVADR ARPGDRVVVF
GSFRTVEAVM RAGGRVD