Gene Mlg_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0907 
Symbol 
ID4269292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1025247 
End bp1026548 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID638125659 
Producthypothetical protein 
Protein accessionYP_741751 
Protein GI114320068 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.68427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGC ACAAGAACAT CTTCGTATTC GGGCTGGACG AGGAGAACCA CGCGTTTCTC 
AAGCGCGTCC GGCACGCGGA GCACTATACG TTCCACGGCC TGCTGAACCG CTCCGAACTG
GTGGAGCGGG ACGATTACGA CATCCCGCAC CTCATCGAGA AGGTGCGGAC CCGGCTGAAT
GGCTTCGATG GGTCCGTGGA CGGCCTGATC CACTACATCG ATTTTCCGGT GAGCACCACG
GTACCCCTGC TGGCCCGGGA GTACGGGCTG CCTTCGGCGT CCCTGGAGGC GGTGCTCTGC
TGCGAGCACA AGTACTGGGC GCGGGTGGAG CAGGCCAGGG TGATCCCCGA TGCCTGCCCG
CCCTTTGCCG CCTTCGACCC CTTCGACGAC CGTGCCCGGG CGCGGCTCGA GGCGGAGATC
GGCTACCCGT TCTGGGTCAA GCCCATCAAG TCCTTCTCCA GCTACCTGGG TTTCCGTATT
GACGGACCGG AGGACTTCGA GCACGCCCAG GCGCGGATGC GGGCGGGAAT CGGTCGCTTC
GCCGAGCCCT TCGACTACCT GCTGGACCAG GTGGAACTCC CCCCGGAGGT GCGCGGCATC
GGCGGCGGCC ACTGCCTGGC CGAGGGCATC ATTGGTGGCC ATCAGTGCAC CCTGGAGGGC
TATGGCTACC AGGGCCATGT GTATGTCTAC GGTGCGGTGG ACTCGGTCCG CGAGCCCAAC
GGCTCTTCAT TCCGGCGCTA CCAATACCCG TCGGTGCTGC CCGAGCCGGT CCAGCAACGG
ATGATCGAGC AGGCGCGGCG GTTCATGACC CACATCGGCT ACGACAATGC GCCCTTCAAC
ATTGAGTTCT ACTGGGACGA AGCCACAGAC GATGTCTGGC TGTTGGAGGT CAACACCCGG
CTGTCCCAGT CCCACTGCGA CCTGTTCGAG AAGGTGGACG GGGTCTCGCA CCAGGAGGTG
GCCGTTGACC TGGCGCTGGG TCGGGCGCCG GAGTTCCCGC AGGGGCGGGG CGAGTTCCCC
ATGGCGGCGA AGTGCTTCCT GCGGGTGTTC GAGGACGGCA AGGTCACCCG GGTGCCCAGT
GCATCGGAGG TGCGCGCTTT GGAGGAGGCC TTCCCCGGGA CCCGTATCCA GATCCAGGCT
CGCGAGGGTG CCTGGCTGTC CCAGCTCTGG GATCAGGACA GTTACAGTTA CTGCCTGGCG
CTGATCTTCC TGGGCGGCGA GAATGAAGAG GACATAGAGG CCCGTTTCGA ACACATCCGC
GAGGGCCTGG ACTTTCGGAT CGAGAAGCCG GAGGCAGCGT GA
 
Protein sequence
MVAHKNIFVF GLDEENHAFL KRVRHAEHYT FHGLLNRSEL VERDDYDIPH LIEKVRTRLN 
GFDGSVDGLI HYIDFPVSTT VPLLAREYGL PSASLEAVLC CEHKYWARVE QARVIPDACP
PFAAFDPFDD RARARLEAEI GYPFWVKPIK SFSSYLGFRI DGPEDFEHAQ ARMRAGIGRF
AEPFDYLLDQ VELPPEVRGI GGGHCLAEGI IGGHQCTLEG YGYQGHVYVY GAVDSVREPN
GSSFRRYQYP SVLPEPVQQR MIEQARRFMT HIGYDNAPFN IEFYWDEATD DVWLLEVNTR
LSQSHCDLFE KVDGVSHQEV AVDLALGRAP EFPQGRGEFP MAAKCFLRVF EDGKVTRVPS
ASEVRALEEA FPGTRIQIQA REGAWLSQLW DQDSYSYCLA LIFLGGENEE DIEARFEHIR
EGLDFRIEKP EAA