Gene Mlg_2589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2589 
Symbol 
ID4270298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2932553 
End bp2933980 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content66% 
IMG OID638127348 
Productputative nitrate transporter component 
Protein accessionYP_743419 
Protein GI114321736 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00200575 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAACC GATCACTCAA CGACCCGTTC AACCCGGAGG CGGACCTGCG CCACGGAGCG 
GGCTGCACCT GCTCCAGCTG CGGTGGCGGC GAACATGCCC AGCACGACCA CGGGGCCGCC
CCAGCGGAGC AGGACGCCAA CCAGATGCTG GCCGCGAAGC AGGGCGTCGA CAAAGAGGCC
ATGTTCGACC GGGCCGTGGA GAGCGCGGTG GTGCGTGCCC TGTTCGGTCA CCACGACGCC
AGCCGGCGTT TCTTCCTGAA GGCGGTGGGC GCCGGCACCT TTGCTGCGGC GGTGGGCTCC
ATATTCCCGC TGGATGCGGC CAAGGCCATG CTCAAGGACA ACCTGGGTGA CCCCGAGAAG
CGCGACCTGA CCGTGGGCTT TGTGCCCATC ACCTGCGCCA CGCCCATCAT CATGGCCCAC
CCCATGGGCT TCTACGAGCG CTACGGGCTG AACGTGGACC TGCGCTCCAC CGCCGGCTGG
GCGGTGGCCC GGGATATGTC CATGAACCGG GAGTACGACG CCTCCCACAT GCTCACCCCC
ATGCCCCTGG CCATGACCAT GGGCACCGGC TCGTCGAGCA TGCCCTTCAT CATGCCGGCG
GTGGAGAACA TCAACGGCCA GGCCATCACC CTGCACAACA AGCACAAGGA CAAGCGCGAC
CCCAGCCAGT GGAAGGGCTT CCGCTTCGCC GTGCCCTTCG ACTTCTCCAT GCACAACTTC
CTGCTGCGCT ACTACGTCGC CGAGCACGGC CTGGACCCGG ACCGGGATAT CCAGATTCGC
GTGCTGCCGC CGCCGGAGAT GGTGGCCAAC CTGCGCGCCG GCAACGTGGA CGGCTACCTG
GCCCCCGACC CCTTCAACCA GCGGGCGGTC TGGGAGGAGG TGGGTTTCAT CCACCTGCTC
ACCAAGGAGA TCTGGGACGG CCACCCCTGC TGCGCCTTCG CCACCAGCCG CGCCTTTGCC
GAGGAGTACC CGAACAGCTT CGGTGCCCTG TTCAAGGCCA TCGTGGACGC CACCCACTAT
GCCTCCGAGC ACGAGAACCG GGCCGAGATC TCCGAGGCCA TCGCCCCGCG CAACTACCTG
AACCAGCCGG TATCGGTGAT TCAGCAGGTG CTGACTGGCC GCTACGCCGA CGGGCTGGGC
AACGTCAAGG AGGACCCGGA CCGGATCGAC TTCGACCCCT TCCCCTGGCA CTCCATGGCG
GTGTGGATCA TGACCCAGAT GAAGCGCTGG GGTTACGTGG ACGGCGACGT GGACTACAAG
GGCATCGCCG AAGAGGTCTA CCTGGCCACC GACTGCGGCA AACTCATGCG CGAGCTGGGC
TACGAGCCGC CGGAGGTCAC CTACAAGAGC CACATGATCA TGGGCAAGGC GTTCGACCCT
GAGCAGCCGG AGGCCTACGT GGACAGCTTC GAGATACGGA GGTCGTAA
 
Protein sequence
MSNRSLNDPF NPEADLRHGA GCTCSSCGGG EHAQHDHGAA PAEQDANQML AAKQGVDKEA 
MFDRAVESAV VRALFGHHDA SRRFFLKAVG AGTFAAAVGS IFPLDAAKAM LKDNLGDPEK
RDLTVGFVPI TCATPIIMAH PMGFYERYGL NVDLRSTAGW AVARDMSMNR EYDASHMLTP
MPLAMTMGTG SSSMPFIMPA VENINGQAIT LHNKHKDKRD PSQWKGFRFA VPFDFSMHNF
LLRYYVAEHG LDPDRDIQIR VLPPPEMVAN LRAGNVDGYL APDPFNQRAV WEEVGFIHLL
TKEIWDGHPC CAFATSRAFA EEYPNSFGAL FKAIVDATHY ASEHENRAEI SEAIAPRNYL
NQPVSVIQQV LTGRYADGLG NVKEDPDRID FDPFPWHSMA VWIMTQMKRW GYVDGDVDYK
GIAEEVYLAT DCGKLMRELG YEPPEVTYKS HMIMGKAFDP EQPEAYVDSF EIRRS