Gene Mlg_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1475 
Symbol 
ID4269267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1685772 
End bp1686995 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content62% 
IMG OID638126231 
Producthypothetical protein 
Protein accessionYP_742314 
Protein GI114320631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.843921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA ACCCCGACAA AGGTTTTGTA GCCCTGCTGG GCTGGAGCCT GAACGCCGTT 
GAAGCCGCTG AGAACTTCGA CCGGCGCTAC GTGGTGGTGG CCCCGGACTG GGCCGAGGAT
TACTGTCAAC AGCACAATAT CCCCTATGTG CCCTGGAACT TCGAGCGCCT CAACGACCGC
TCAATGGAGA TCGCCCAAAC CCTCAAGGAC ATGGGCGTGG ATGTCGCCAT CCCGCTGTTC
GAAGAGACCG TGGAGTGGGC GGGGGCCATC AATGCGGTGT TGTTGGACAG CCCCAAGCTG
CTGGGTCAAT CGATGCTGCT GCGCGACAAG TCGCTGATGA AACGCCGCGC GCAGCTCGGC
GGTATCCGGG TGGGCATCTT CGAGGAGGCC CATGACAAGG ATGACGTCAT CCGTTTTCTC
AAGCGGGTCA ACCAGACGCT GCTAAAGCTG GACGGCGATC CCAACGACCC CATCCACTTC
AAGGCGTTCG ACAAGGCCGG CTGCCTCGGC CATCGGGTCA TCCGCACCCC GGATGACGTC
GATACCATCC CCGAGGAGGA GTTTCCGGCG CTGATGGAGT CCCATCTGGA CGGCTGGGAG
TTCGCCGTGG AGGCGTGGGT CCAGAACGGC AAAATCCGCT TCATGAATAT CTCCGAGTAC
GTCACCCTGG GCTACTCGGT GTTCGTGCCC GCCTCGCCGG AACTCGAGGC CCACCGGCCC
GCCATCCGAA GGGAGCTGGA AAAGCTCATC AAGGCCTTCG ACATCGAATT TGGCTTCGTC
CACCCGGAGT ACTTCGTCAC CAACGACAAC GAGATGTATT TTGGCGAAGT GGCCTACCGG
CCGCCGGGCT TCAAGGTCTT CGAGCTGCTG GAGCGGGTCT ACGGCTTCAA CGCCTACCAG
GGGCTGATCC TCTGCTTCGA CCCCAAGACC ACCGAGGAGG AGATCACGGC CTTCTTCCCG
CGCGAGGTGG TGGATGCGGA TGGGCATGCC GGCTGTTTCG GCGTCTATCC GCGGCGCCGT
GTGGTCAGCC GCCTGGAGAT CCCGGAGGAG ACCGAGAATC ACCCCTACTT TGAGTCCCAC
GAGCTAACCC CGCCACTGGA GGAGACGGTG ACCAAGCGCA CCGCCTTCGG TACCCACTGG
GGCCTGATTT ACTTCCGCGG CGAGGACCCC CACGTGCTTC GGGATCTGCT GAAACACCAG
GAGGATCTGG ACTTCTACGT CTAG
 
Protein sequence
MEKNPDKGFV ALLGWSLNAV EAAENFDRRY VVVAPDWAED YCQQHNIPYV PWNFERLNDR 
SMEIAQTLKD MGVDVAIPLF EETVEWAGAI NAVLLDSPKL LGQSMLLRDK SLMKRRAQLG
GIRVGIFEEA HDKDDVIRFL KRVNQTLLKL DGDPNDPIHF KAFDKAGCLG HRVIRTPDDV
DTIPEEEFPA LMESHLDGWE FAVEAWVQNG KIRFMNISEY VTLGYSVFVP ASPELEAHRP
AIRRELEKLI KAFDIEFGFV HPEYFVTNDN EMYFGEVAYR PPGFKVFELL ERVYGFNAYQ
GLILCFDPKT TEEEITAFFP REVVDADGHA GCFGVYPRRR VVSRLEIPEE TENHPYFESH
ELTPPLEETV TKRTAFGTHW GLIYFRGEDP HVLRDLLKHQ EDLDFYV