Gene Mlg_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0107 
Symbol 
ID4268194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp116491 
End bp119859 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content71% 
IMG OID638124833 
ProductVanZ family protein 
Protein accessionYP_740954 
Protein GI114319271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACC GCGAGCCCTG GCAACGCGGC ACCTGCCTCA CCTGGGCCTG GCTGGCCCTG 
GGCGCCGCGG TCTATCTCAC GTTGCTGCCC TTCGAGTTCC GCGATGATCT CGGCCTGCGT
CAGGCGTGGG AGGTCTATCG GAACATGGAC CTTACCGGTC CCGGCGCCTC GGGCCGCCAG
CAGTTCATGG CCAATGCCCT GATGTTCCTG CCGTTGGGCT TTTTCTGGTC GGCCTGGCTG
GCCTATGGGC GCCGCGACAG GGGCACCACC CTGGCCACCT TCCTGGCCGT CTCCGCCCTC
GGTCTGGCGG TGACGGCCAC GGTGGAGTTC CTTCAGATCT GGCTGCCTTT CCGGCACCCG
GCGGCCGCCG ATATCGCCGG CAACTTCACC GGCGCGGTGC TGGGCTCGCT GGCCTGGTTC
GCCTTGCGCG ATCCGCTGGC CCATTGGTGG CAGACCCTGT CCGGGGGCGG ACCGCGCGCG
CTCCACCTGG CCCTGATCGG GTATTTGGTG GTCTACGTGG TCATCGGGCT GCTGCCCTTC
GACATGGTGC TCAGTGCCGA CGAACTGATG CGCCGGCTAC GGTCCAGCGC CTGGGGCCTG
TGGACCACCC CCGACGCCTG CACGACGGGG CTGCGCTGCT ACGCCTGGCT CAGCCTGGAA
GTGGCCGCGG CGGTACCGGT GGGCCTGCTT CTCGCCGTCT GGCTGCAGCG GCGCCGGTTC
CCAACTCTGC TGGCCGGGCT GCCGCTGGCG GTGGTGCTGG CCATCGGGCT GGAGGCCCTC
AACCTATTGA CTCTCAGCGG GATCACCGAG GGCCGGTCTG TGCTGCTGCG GAGCGCCGGC
ATAGCGGCGG GTCTAGTCCT GCTGCACCGG CTACCGGCCC ACTCCCATGG GGTGCTCCGC
TTCCTGCAGC AGCATGGGAG GCTGATCCTC GGCCTGACGC TCCCCGTCTA CCTGCTGTTG
CTGCTGGGCC TGAACCACGG CTTCGGCCCT TACCACACCG ACCTGGAGCG GGCCTGGCGG
CAGTGGGGCG AGCTGCGGCT GCTGCCCTTT TACTACCACT ACCACGTGCC GGAGATCGTG
GCCCTGCGCA GCACCGCGTT GCACCTGGCC ATGTACGCAC CCGTCGGCCT GATGGCCTGG
CTCTGGCACG TCGGGCGTCC CGGCGGCCAG GCCCGGCTCT ACGGGCTCGC CGCCGCCAGC
GGGGCACTGG CGGCGTTGCT GATGGAGGCC GGCAAGCTGT TCGTTGCCGA CGCCCGGCCC
GATCCCGCCA GCCTGGTGCT CGGTGCCCTG GCCGCCTGGG CGGTGGCAGC CGCCTGCGCC
TGGCTGACCG CCACCAGCAG CAGTCGGCCG GTGGCGCCCG CCGCGGCCCC GTCCCCCTCC
GGGCCGTCGG CCTCGCCACG CCACGTGCTG ACCGGCGAGA GCCCCTGGCA CCAGGCGCTG
GGCGCGTTGT GCGGGGTGCT GGCCCTCGTC CTCGCCCTGA GCTGGCCGGT GGCGGCGTGG
GCCCTGACGG CGGGCCTGGT CGCCTACCTG GCGCTACTGC ACTGGCGCCC CCAGATCGGG
CTCCTGATTA TCCCCCTGCT GATCCCCACC CTGGACCTGA CGCTCTACAC CGGGCGGCTC
TTCCTCTCGG AACTGGACCT GTTCCTGCTC GCCACCCTGG CGGTGGTGCT CTGGCGCTGG
CCGGCGCGCC CCAGATCCAG CCTGTTGCCA AGGGCGATCC GCTGGCCGCT GATACTGCTG
ATGGCCTCAA CCGTGATCAG CCTGGTCATC GCCCTGCTGT CGAGCACCGC AATCGATCTT
GGGCAGGCCT ACCACTACGC CCATCCGTTG AACGCCCCGC GGGTGGCCAA GGGGCTGCTG
GGGGCCCTGG TGCTGCTGGG TCTGATGCGC GTCATCCCCT TGCCCCAGCG GGAGCAGGTG
GAGCGCTGGC TGCTCCCCGG CGTGGCAGCG GGCTTGCTCG CCACGATCCT GATTGTCATC
CGGGAACGGA TCACCTACCC GGGCCTGCTG GACCTGGACA GCCGCTACCG CATCTCGGGC
TTTTTCACGG ACATGCATGT GGGTGGTCCC AGCATCGAGA CCTACCTGGT ACTGGCGCTG
CCCGTCGCCC TGACCTGGTG CCTGCACCGG CGGCTGACCT GGGCGCTTGC GCCCCTGCTG
GCCATCGGCG CCACCTACGC GATCGCGGTG ACCTATTCCC GCGGCGGCTA CCTGGGTCTG
GTCGTCGCGC TGGTGCTCTT CGCCGGGCTG GCCCTGGCCC ACGCCCTGCG CCCGGGTGCA
ACAGGGCGTG GGCGGCTGAT CACAGCCGCC TTGGTACCGC TCCTGGCGAC CGGCGCGGTG
GCGCTGCCCT TCATGGACAG TTTCGGCGAG CGCCGCCTGG GGCAGGTGCA GGCCGATTTC
GAACAGCGCC TGAGCCACTG GCGCGAGGGC CTGGACCTGC CCGGCGCCGG GCCCTGGTCA
CCCCTGATCG GGCGGGGCCT GGGCAGCTTC CCGGAGGCCT ACCGCCTCGG CAACCGCGAG
GGGCGCATCC CGGCCAATTT CGACTTCCGG CGCGTCGATG GCAATGACCT GCTGCGCCTG
GGACGGGGCG ACTCCCTGTT TCTCAACCAA CGGGTGGCCC CGCCCCGGGA GGGGGACTTC
CGGCTGCGGG TGCGGGCCCG CAGTGACGTG CCGGCCGGGT TCACGCTCTT CCTTTGCGAG
AAGCCGGTAC GTCACTCCTT CACCTGCCGC TCCCAGGGGT TGAGCCTGGG TGGTGGTGGG
GATTGGGAAC ACCTGGAATG GCGCTTCGAC CTCCGGGACA TGGACCGCCG CCCCTGGCCC
TTTCAGCGCG GACTGGTCCT CTCCATCACC AGCCGCGGTG AGCCGGGCAT CCTCCTGGAG
TTCGACGCCT TCGAACTGCT TGATGAGGCG GGCAACGACC ACCTGCGCAA CGGGGATTTC
ACCCGGCTCG GCCGGCACTG GTACATGAGC ACGGACCACC TGTGGCCCTG GCGAATCGAG
AACCAATGGC TGGAGCTCTA CTTCGACCAG GGCGCGCTGG GGCTGCTGGC CTTCGTCTGG
CTGACCCTGG CCGGGCTGCT CCTGCTGGCC CGGCGCGCCC TCGCCGGCGA CATCACGGCG
CTGGGCCTGG CCGCCGGTCT GGCCGGGGCG CTCGCCGTGG GGTTGTTCAG CACGATCTTC
TTCTCGCCCC GCATCGCCAC GCTCTTCTAC GGGCTGTTGT TGCTGGGCGT TGCGGCCACC
GGCCGGCCTG ACAAGCGGCA CGCCGGACCC ACGCCGCAGG CACCAGTGCC CGCCGCTCAA
TGCAGCGGCC CGGCCCCCGG GCCCACACCA GCCGCCCGGC CCGGGTTGCC CTCGGCGCCG
CCATCGTAG
 
Protein sequence
MTHREPWQRG TCLTWAWLAL GAAVYLTLLP FEFRDDLGLR QAWEVYRNMD LTGPGASGRQ 
QFMANALMFL PLGFFWSAWL AYGRRDRGTT LATFLAVSAL GLAVTATVEF LQIWLPFRHP
AAADIAGNFT GAVLGSLAWF ALRDPLAHWW QTLSGGGPRA LHLALIGYLV VYVVIGLLPF
DMVLSADELM RRLRSSAWGL WTTPDACTTG LRCYAWLSLE VAAAVPVGLL LAVWLQRRRF
PTLLAGLPLA VVLAIGLEAL NLLTLSGITE GRSVLLRSAG IAAGLVLLHR LPAHSHGVLR
FLQQHGRLIL GLTLPVYLLL LLGLNHGFGP YHTDLERAWR QWGELRLLPF YYHYHVPEIV
ALRSTALHLA MYAPVGLMAW LWHVGRPGGQ ARLYGLAAAS GALAALLMEA GKLFVADARP
DPASLVLGAL AAWAVAAACA WLTATSSSRP VAPAAAPSPS GPSASPRHVL TGESPWHQAL
GALCGVLALV LALSWPVAAW ALTAGLVAYL ALLHWRPQIG LLIIPLLIPT LDLTLYTGRL
FLSELDLFLL ATLAVVLWRW PARPRSSLLP RAIRWPLILL MASTVISLVI ALLSSTAIDL
GQAYHYAHPL NAPRVAKGLL GALVLLGLMR VIPLPQREQV ERWLLPGVAA GLLATILIVI
RERITYPGLL DLDSRYRISG FFTDMHVGGP SIETYLVLAL PVALTWCLHR RLTWALAPLL
AIGATYAIAV TYSRGGYLGL VVALVLFAGL ALAHALRPGA TGRGRLITAA LVPLLATGAV
ALPFMDSFGE RRLGQVQADF EQRLSHWREG LDLPGAGPWS PLIGRGLGSF PEAYRLGNRE
GRIPANFDFR RVDGNDLLRL GRGDSLFLNQ RVAPPREGDF RLRVRARSDV PAGFTLFLCE
KPVRHSFTCR SQGLSLGGGG DWEHLEWRFD LRDMDRRPWP FQRGLVLSIT SRGEPGILLE
FDAFELLDEA GNDHLRNGDF TRLGRHWYMS TDHLWPWRIE NQWLELYFDQ GALGLLAFVW
LTLAGLLLLA RRALAGDITA LGLAAGLAGA LAVGLFSTIF FSPRIATLFY GLLLLGVAAT
GRPDKRHAGP TPQAPVPAAQ CSGPAPGPTP AARPGLPSAP PS