Gene Mlg_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1274 
Symbol 
ID4268937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1477051 
End bp1478430 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content74% 
IMG OID638126024 
Producthypothetical protein 
Protein accessionYP_742113 
Protein GI114320430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.373171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCA CCGATCTGAT CAAGCGGCTG CTGCCCACCT CCGGCAAGGC AACCGACAAG 
GCGCCGGAGC CATCGGCCCG GTCGGCCCCC CACGATTCCG CAGGCCAGGC CCGGCTGATC
GCCATCGCCC GGAGCGATGC CGAACCCGGC GCCCGGGCCG ACGCGGTCAA GCGGCTGACC
GATCTCGACA CCCTCCAGGC CTGCCTGGCG CCGGCCACAC CGGCGCCCGT CCGGATGGCG
GCGGTGGCAC GCTTGAGCCT GCTGCTGAAA TCGGACGACC CGGGCCTCGC CCCGCAGGAG
CGGGTGGCGG CTGTCCGCCA TTGCCCCGAC ACCACCGTCC TCGCCCACCT GGCACAGTCG
GCCCGGCTCG AGGCCGTGCG ACGGGCGGCA CTGGACCGCC TGCGCACCCC CGCGGCCTGT
CTCCAGGCCG CGCTGCACGA CCCGGTCCGA CGGCAACGCA AGTTTGCCGT TGAGTGCGTC
GACCACCTGG AGACCCTGGA GGCGATTGCT GCCCAGAGCG ACGACCGCGG CGTGGCCCGT
CTGGCCCGGC GCCGCCTGCA GGCCCTCCGC GACGAGCAGG CCGAGCAGCA GGCGGTCCAG
ACCCAGGCGG TGGGGCTGTG CGAGGCCATG GAGGCGCTGG CAAGCGCGCC TTGGCGCGAT
GACCTGCCGG CCCGTCGCCA GCGCCTGGAG AACCAATGGC GGCAACTTGA CCCGACGCCA
CCCCCGGCGC TGGCGGACCG CTTTGCCCGG GCCCAGGGCC ATTGCGCGGC CCGCAGGGCA
CCCCAGGCCG GCGACCGGGA AGCGCGGCTG CTGAACGCCC TGGAGGAGGA GGCGCGGGCG
CTCACCCACC ACCCGGAGCC GGAAGAGGCG CGGCTCCGCC AGCTCCGCGA GTCGCTGGCG
CGCACCCGCC GGGAATGGCT GCACCTGGGT GCGGACCCGG CCACCGAGGC CCGCTTCCGC
ACCCGCTACT GGCGTCTGGA ATGCTGGTGT GCCGACGCCC GGCGGTTGCT CGACCAGCAA
CGGATCATCG AGCAATTGCT CAGTGAGGCC GACGCCCTGC CCCTCACCGA GGCGCCCCCC
CTTCTGCGCC ACGCCCGGCA GTTGCAGCAG GCGTTGCGAC AGGCCCCGTG GCACAGCGGA
TTCCCGCTGC CGCGTCTGCT CAGGGAGGGC CAGGCGACGG TCAAGGCGCT GGAGCGCACG
GCCCGCCATG CCGGCCAGAC CCGCGTGAAA CGGCTGCAGG CCCTCCACCA CCTGATGGCC
AGCCTGGAAC AGGCCATCGA AGAGCGCGCC TGGGGGCGGG CCCGCCGACT AATCAGCGAG
GCCCTGCGCG AGACCGGCGA GCGGCCCGCC GGCCCCGCTG GGGATCAGAG GAAGAGGTAG
 
Protein sequence
MSPTDLIKRL LPTSGKATDK APEPSARSAP HDSAGQARLI AIARSDAEPG ARADAVKRLT 
DLDTLQACLA PATPAPVRMA AVARLSLLLK SDDPGLAPQE RVAAVRHCPD TTVLAHLAQS
ARLEAVRRAA LDRLRTPAAC LQAALHDPVR RQRKFAVECV DHLETLEAIA AQSDDRGVAR
LARRRLQALR DEQAEQQAVQ TQAVGLCEAM EALASAPWRD DLPARRQRLE NQWRQLDPTP
PPALADRFAR AQGHCAARRA PQAGDREARL LNALEEEARA LTHHPEPEEA RLRQLRESLA
RTRREWLHLG ADPATEARFR TRYWRLECWC ADARRLLDQQ RIIEQLLSEA DALPLTEAPP
LLRHARQLQQ ALRQAPWHSG FPLPRLLREG QATVKALERT ARHAGQTRVK RLQALHHLMA
SLEQAIEERA WGRARRLISE ALRETGERPA GPAGDQRKR