Gene Mlg_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2549 
Symbol 
ID4270937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2891927 
End bp2893552 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content69% 
IMG OID638127308 
ProductNAD+ synthetase 
Protein accessionYP_743379 
Protein GI114321696 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.402405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0553974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC GCGTCACCAT GGCCCAACTG ACCTGCCCGG TGGGCGACAT CGAAGGCAAT 
ACCCGCCGCA TCGTCGGGGC CATCGAGACC GCCCGCGACA GGGAGGGCGC TCAACTGGTC
GTCTTCCCCG AGCTGGCCGT CACCGGCTAC CCGCCGGACG ACCTGCTGCT GCGAGACGAC
TTCACCCGGG CCGCGGAGAA CGCACTGCAG GCCATCCAGG CCGCCAGTCG GGGTGTCACC
GCCGTGGTCG GCGTGCCGTT GCGCGATCGC CGTGGACTGC ACAACGCCGC CGTGGTGGTC
CAGGACGGCC GGGTGATCGC GCGTTACGCC AAACGGGAAC TGCCCACCTA CAGTGTGTTC
GACGACAGCC GCCACTTCGT CGCCGGCGAC AGCCCCTGTG TGGTCGACGT GGCCGGCACC
CGGGTGGGGC TCAGCATCTG CGAGGACATC TGGTGGCCGA CACCGGCCCG CGAGGCGGTG
GCCGCCGGGG CCGAGTTCGT GGTCAACCTC AATGCCTCGC CCTTCCACCG GCGCAAGCAG
GCGGAACGCG AGGCGGTCCT GCGCGAGCGC GCGCTGGACA CCCACCGCCC CCTGCTCTAC
GTGAACATGG TCGGCGGCCA CGACGAGGTG GTCTATGACG GTGGCTCGCT GGCCGTGGAT
GCCGGCGGCA CCGTCCAGGC GCGCGCCCCC CGCTTCCGAA GCGGGCTCTG CACCGTGGAG
GTGGACACGG ATCACGGCAA TGTCAACGGC GAGCAGAGCA CCCAGCCCTC GGAGGAGGGG
GCCGTGTACC AGGCGCTGGT CACCGGTCTG CGTGACTACG TGCAGCGCAA CGGCTTCCCC
GGCGTAGTCC TGGGGCTGTC CGGGGGGATC GACTCGGCCG TTGCTGCGGC AGTGGCCGTG
GATGCCCTCG GCGCCGACCG GGTCCAGGCG GTGATGATGC CCAGCCGCTA TACCGCGCCC
ATGAGCTTGG ACGATGCCCA GGCCATCGCC CGCATGCTGG GCATTCGCTA CCAGACCACC
TCCATCGAGC CCATCTTCCA GAGCTTTCTC AGCAGCCTGG CCCCCAGTTT CGAGGGGCTG
GACCCGGACG TGACCGAAGA GAACCTGCAA TCGCGCATCC GCGGCACCCT GCTCATGGCG
CTGTCCAACA AGACCGGGCG CATGGTGCTG GCCTGCGGCA ACAAGAGCGA ACTGGCCGTC
GGCTACGCCA CGCTCTACGG CGATATGTGC GGCGGCTATG CACCGCTCAA GGATGTCTAC
AAGACCGAGG TCTACCGGCT GGCCCGCTAC CGCCAGAGCC TCAAACCGGC ATTCCCCGAC
AACATCTTCT CACGCCCACC GACCGCCGAA CTGGCGGCGG GCCAGAAGGA CGAGGACAGC
CTGCCACCCT ATCCGGTGCT GGACGATATA CTGGAGAGGT ACGTGGAGCA CGACGAGAGC
GAGGCACTGA TCGTCGCCGC CGGCCATGAA CCGGCCACCG TGGCCCAGGT CACCCGGCTG
CTGCGGCGCA ACGAGTACAA GCGGCGCCAA TCCGCCCCCG GCCCCAAGGT AACCCCCCGG
GCCTTCGGGC GCGACCGCCG CTATCCCATC AGCTCAGGTT GGCCGGGCGT ACCGGTAACC
TCCTGA
 
Protein sequence
MSLRVTMAQL TCPVGDIEGN TRRIVGAIET ARDREGAQLV VFPELAVTGY PPDDLLLRDD 
FTRAAENALQ AIQAASRGVT AVVGVPLRDR RGLHNAAVVV QDGRVIARYA KRELPTYSVF
DDSRHFVAGD SPCVVDVAGT RVGLSICEDI WWPTPAREAV AAGAEFVVNL NASPFHRRKQ
AEREAVLRER ALDTHRPLLY VNMVGGHDEV VYDGGSLAVD AGGTVQARAP RFRSGLCTVE
VDTDHGNVNG EQSTQPSEEG AVYQALVTGL RDYVQRNGFP GVVLGLSGGI DSAVAAAVAV
DALGADRVQA VMMPSRYTAP MSLDDAQAIA RMLGIRYQTT SIEPIFQSFL SSLAPSFEGL
DPDVTEENLQ SRIRGTLLMA LSNKTGRMVL ACGNKSELAV GYATLYGDMC GGYAPLKDVY
KTEVYRLARY RQSLKPAFPD NIFSRPPTAE LAAGQKDEDS LPPYPVLDDI LERYVEHDES
EALIVAAGHE PATVAQVTRL LRRNEYKRRQ SAPGPKVTPR AFGRDRRYPI SSGWPGVPVT
S