Gene Mlg_0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0131 
Symbol 
ID4269824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp148926 
End bp150548 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content68% 
IMG OID638124855 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_740976 
Protein GI114319293 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAG GGTTCGACGG CATCTACGGT CTCCACCAGG GTTTCTTGCA ATCCGTCGAC 
CGCCACTCCG GTCGCCCCGC GCTAGAGGTA CAAGGGGCGC AGTACAGCTA CAGCGACCTC
TACCACCAAG CCGCAGCGGT CGGTGCTGCC CTGCAAAGCG CCATGCCGGA GACCACGGCG
CCGTTGGTCG GGGTACTGGC CAATCGCTCT CTGCCCGTCT ACAGCGGCCT GCTCGGCACG
CTCATGGCGG GGAAGGCGGT GGTGCCGCTG AACCCTGGCT TCCCGCAGGA ACGCACCCAA
CAGATGGTTG AACAGGCCGG CCTGCAGGCC CTGGTGGCGG ATGGGCAGGG TGAGGCCCTG
CTCAGCGACT TGCTACCTGG AATGGATGTC CCCATGGTCG TGGTGCTGCC CCTTGCCGAA
TCGGCCCAGG CACTGCAGGC GCGCTTCCCG CAGCACCGTT TCCTGACGCG CGCGGAGTTG
GGTGCGCCGT CTGACTGGCG CCCGGCATCG GTGCAACCCG ACGACCTGGC CTATCTGTTC
TTCACCTCCG GCAGCACGGG AACGCCCAAA GGCGTGGGAG TGCTTCACCG CAACGCTCTG
CGCTTCGTCG CCATGTCCCT GGAGCGCTAC CGGCCGTTCG GGATCAGCGA GGCAGACCGC
TTCTCGCAGT TTTACGACAT CACCTTCGAC TCCTCGATGT TCGACCTGTA CGTCTCCTGG
GCCTTCGGCG CCTGCCTTTG CTGCCCCACG GCGAAGGAGT GGTTCAACCC CAACAAGTAC
ATCGAGGAGG GCAGGCTGAG CGTCATCGAT ATCACGCCCT CCGCCGGTCA CGGCATGAAC
CGGCGGGACG GCTGGCGCCC GGGCCGCTTC CAGGCCCTGA GGCTGTGCCG GTTCGGTGGT
GAGGCCCTGT CCGCGGAGCT GGCCACCGCG ATGGCCGCAG CAGCCCCCCA TGCGCGGGTG
GACAATGCCT ACGGCCCCAC GGAATGCACC GTGGATTCCG CCTATTACCT GTGGGACCCG
GAACGCTCGC CGGGCGAGTG TGAACACGGC ATGGTCCCCA TCGGTTACCC GGGCAACCAG
GTCCAGCTGA CCGTGGTGGA TGACGACCTG CAACCGGTTC CCGAGGGGGC CGAGGGCGAG
TTGCTAATTG GCGGGCCACA AGTCACCCCG GGCTACTGGA ACGATCCGGA GCGCACGGAA
CAGGCCTTTA TCCGGCTGCC CTCGGACGGC GCGGTTCACT ACCGGACCGG CGACCTGGTG
CGCCGCCCAC CGGCCGGCAA GCCCATCATG TTCCTGGGCC GCATGGACCA TCAGATCAAG
GTGGGCGGGG TACGCATCGA ACTGGGCGAA GTGGAGCAGG CCCTGCGCGA GGCGGCCGCC
ACCGACGAGG CCGTGGCCCT GGGCTGGCCA CGTACCTCCA GCGGTGCGGC CGGGATTGTG
GGCTTCGTGG TGGCCGGGAC GGCCGACGAG GCCGCGATAC GCGATCAGCT GCGCAGCCGG
CTGCCCAGCG TGATGGTGCC CCGGGTCATC CACGCCCTGG AGGCGCTGCC GCTCAACCCC
AACGGCAAGG TGGACCGCAA GGCCTTGATG GCCCGGCTGG AAGCGGAGGC CGGGGGTCGA
TGA
 
Protein sequence
MATGFDGIYG LHQGFLQSVD RHSGRPALEV QGAQYSYSDL YHQAAAVGAA LQSAMPETTA 
PLVGVLANRS LPVYSGLLGT LMAGKAVVPL NPGFPQERTQ QMVEQAGLQA LVADGQGEAL
LSDLLPGMDV PMVVVLPLAE SAQALQARFP QHRFLTRAEL GAPSDWRPAS VQPDDLAYLF
FTSGSTGTPK GVGVLHRNAL RFVAMSLERY RPFGISEADR FSQFYDITFD SSMFDLYVSW
AFGACLCCPT AKEWFNPNKY IEEGRLSVID ITPSAGHGMN RRDGWRPGRF QALRLCRFGG
EALSAELATA MAAAAPHARV DNAYGPTECT VDSAYYLWDP ERSPGECEHG MVPIGYPGNQ
VQLTVVDDDL QPVPEGAEGE LLIGGPQVTP GYWNDPERTE QAFIRLPSDG AVHYRTGDLV
RRPPAGKPIM FLGRMDHQIK VGGVRIELGE VEQALREAAA TDEAVALGWP RTSSGAAGIV
GFVVAGTADE AAIRDQLRSR LPSVMVPRVI HALEALPLNP NGKVDRKALM ARLEAEAGGR