Gene Mlg_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1954 
Symbol 
ID4268123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2223384 
End bp2224601 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content64% 
IMG OID638126709 
Producthypothetical protein 
Protein accessionYP_742786 
Protein GI114321103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0562969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.072351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGTA AACTGACCTG CTATTGCAGC GCGGCGGCGC TGGCTGCCGG TGCGTCGCTC 
TCCGGCGCGG TGATGGCCGA CGAGGCCCGG ATTGCCGAAC TCGAGGAGCG TATCGAGGCG
CTGGAGGCGA GTCCGGCCAC CGGCGACGGC ATCCGGTTCG GGGGCGCGCT GCGCTTCAAC
GTCCGTTACG ACGACACCGA CGCCGGCTCC GCCATCCGCG ACCGCGGCGG CGATATCAAC
CTGGACACCT TCCGCGTCAA CGTGGACGGC CGCCAGGATC GGGTGACCTT TGCCGCCGAG
TATCGCTGGT ATCCGGACTT CGACCAGCAC TTCCTGCACA CCGGCTGGGT GGGCTACGAC
TTCACCGACA CCACCACGGC CCGGATTGGT CAGCAGCGTG CCGCCTTCGG CCTGCAGCCC
TACCAATCCA ACAACTTCTG GTTCAGCAGC AACTACTATG TCGGGCTGGA GGACAAGCTA
GCCATCGGCA TCAACGTAGA CCATGAGCAG GGCCCGCTGA AGCTGGACCT GGGCTTCTTC
AGCAACCCGG CCTCCGGCAG CGCCGGCAGC TCCGGGCACT ACTCCACCGA GGTGGCCCCG
GCCGCTGACT GTGGCGCCGG TGCCGACGCC GGGTTGTGCA ACGAGGAGAT GAACCAACTC
TACGCCCGCG CCGCCTACAC CTTCGACCAT GGACCGGACG CCGCCACTGA GTTCGGCATC
TCCGGCATGG CCGGCAAACT GCGCAACACC CTGACCGGTG ACCGCGGTGA CAGCTGGGCG
GCGGCCGCGC ACCTCAACGG CCAGTATCAG CGCTGGAATG TCATGGCCCA GTTTGCCTCT
TACGAGCATG ATCCGCGCAA CCCGGACGGG GCCAACGACG ACATCATCAA CATGAGCATC
CAGGGCTTTA CCGGCTTTGG GACCCCGTCA GAGGCGGACA CCTTTATCCT GAATGTCGCC
TACGACCTGC CGGTCAGCTT CGGCCCGGTG AGCAACCTGC GCTTCTACAA CGACTACTCC
ACGGTGCGGA GCAAGTCCGA CAGCTCCCGC AACACGGAGC AGAACGTCAC CGGGATGTCC
ATCACCGCGG GTAACATCTT CACCTACGTG GACATCATCC GTGGCAAGAA CCAGCCCTTT
GTCGGCGGCC AGACCATGGT CGGTGATGAC GGCAGCTGGG AGACCCTGTA CAACATCAAC
ATCGGCTACT ATTTCTGA
 
Protein sequence
MMRKLTCYCS AAALAAGASL SGAVMADEAR IAELEERIEA LEASPATGDG IRFGGALRFN 
VRYDDTDAGS AIRDRGGDIN LDTFRVNVDG RQDRVTFAAE YRWYPDFDQH FLHTGWVGYD
FTDTTTARIG QQRAAFGLQP YQSNNFWFSS NYYVGLEDKL AIGINVDHEQ GPLKLDLGFF
SNPASGSAGS SGHYSTEVAP AADCGAGADA GLCNEEMNQL YARAAYTFDH GPDAATEFGI
SGMAGKLRNT LTGDRGDSWA AAAHLNGQYQ RWNVMAQFAS YEHDPRNPDG ANDDIINMSI
QGFTGFGTPS EADTFILNVA YDLPVSFGPV SNLRFYNDYS TVRSKSDSSR NTEQNVTGMS
ITAGNIFTYV DIIRGKNQPF VGGQTMVGDD GSWETLYNIN IGYYF