Gene Mlg_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1004 
Symbol 
ID4268372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1142208 
End bp1143863 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content64% 
IMG OID638125755 
Productnitrate/nitrite antiporter 
Protein accessionYP_741847 
Protein GI114320164 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACCC ATCCGGGTAG AGGCAACGAC AGGAGCACTA TCGTGGCAGC CAACAAAGAC 
ATCGAACATT GGGATCCGGA AGACGAGAAA CAGTGGGAGA GTTTCGGCAA ACGGGTCGCC
AATCGGAACC TGTGGATCTC CATCCCCGCC CTGCTGTGCG GCTTCGTCAT CTGGCTGATG
TGGGGCATGA TCACGGTGCA GATGCGCAAC CTCGGGTTTC CCTTTGAGGA CACCGAGTTG
TTCACCCTCG CCGCCATCGC CGGGCTCTCC GGCGCCACCC TCCGCATCCC GGCCTCCTTC
ATGATCCGCA TCGCCGGCGG GCGGAACACC ATCTTCCTCA CCACCTCCCT GCTCATCATA
CCCGCCCTGG GCACCGGGCT GGCGCTGCAG TCCCAGGAGA CGCCGCTGTG GGTCTTCCAG
GCGCTGGCGC TGCTCTCGGG CATCGGCGGT GGCAACTTCG CCTGCTCGAT GAGCAACATC
TCCACCTTCT TTCCCAAGCG CCAGCAGGGG CTGGCCCTGG GCCTGAATGC GGGGCTGGGC
AACTTCGGCG TCACCACCAT GCAGATCGTC ATCCCGCTGG CGATGACGGT GGGCATCTTC
GGCGCCCTCG CCGGTGATCC GATGGTATTG GAGCGGGCCA GCGGCACATT GATCGGTCGC
ATCGAGGCCG GTACCGAGAC CTTTATCCAG AACGCCGCCT TTGTCTGGCT GCTGCTGCTG
GTCCCTCTGG CCTTCGCCAC TTGGTTCGGC ATGAACAACC TGCGGACGAT CACCCCCAAC
CCGGGGACCC CGTTCGCCGC CTTCGGCAAG ATCCTGGCGC TCTACGGTAT CGGCTTCATC
GCCGCCGCTG TCGGCCTCTA CCTCTTCCTG CCGCCACCGG TGGGCCTGGG GCTGCTGAAC
ATGTGGATCA CCCTGCCGCT GATCATTCTC ATCACCCTGG GGCTGATGCG ACTGATGCCG
GGCGAGGAGA TCAAGCCCAA CATCAAGAAG CAATTCGCCA TCTTCCGCAA CAAGCACACC
TGGTCGATGA CGGTCCTCTA CATTCTGACC TTCGGATCGT TCATCGGGTT CTCGGCGGCA
CTGCCGCTCT CCATTGAGGT CATCTTTGGC AGCCTGATGG AGACCCTGCC CGACGGCACC
ACCCAGCGGG TGGAAAACCC CAACGCCCCC AGCGCCCTCA CCTTTGCCTG GATGGGCCCG
TTCGTGGGCG CGCTGATCCG ACCGGTGGGC GGTTGGCTGT CCGACAAGGT GGGCGGCTCC
ATCGTCACCC AGGCCATCTC GGTGGTCATG GTCATCGCCT CCGCGGCGGT CGGCTGGGTG
ATGATGATGG CCTACAACTC GCCGGACCCG AACACCTGGT TTTGGCCCTT CCTGCTGCTG
TTCATTGTGC TCTTCGCCGC CAGCGGCATC GGCAACGGTT CCACCTTCCG CACCGTGGGC
GTGATCTTCG ACCAGCACCA GAAGGGGCCG GTGTTGGGCT GGACCTCGGC GGTTGCCGCC
TACGGCGCCT TTATCGCCCC GCGCGTGATG GGCGAGCAGA TCGAGGCGGG CACGCCGGAG
CTGTCCATGT ACGGGTTCGC CATCTTCTAC GCCCTCTGCC TGATCCTGAA CTGGTGGTTC
TATCTGCGGA AGAACGCCTA CATCAAGAAC CCCTGA
 
Protein sequence
MVTHPGRGND RSTIVAANKD IEHWDPEDEK QWESFGKRVA NRNLWISIPA LLCGFVIWLM 
WGMITVQMRN LGFPFEDTEL FTLAAIAGLS GATLRIPASF MIRIAGGRNT IFLTTSLLII
PALGTGLALQ SQETPLWVFQ ALALLSGIGG GNFACSMSNI STFFPKRQQG LALGLNAGLG
NFGVTTMQIV IPLAMTVGIF GALAGDPMVL ERASGTLIGR IEAGTETFIQ NAAFVWLLLL
VPLAFATWFG MNNLRTITPN PGTPFAAFGK ILALYGIGFI AAAVGLYLFL PPPVGLGLLN
MWITLPLIIL ITLGLMRLMP GEEIKPNIKK QFAIFRNKHT WSMTVLYILT FGSFIGFSAA
LPLSIEVIFG SLMETLPDGT TQRVENPNAP SALTFAWMGP FVGALIRPVG GWLSDKVGGS
IVTQAISVVM VIASAAVGWV MMMAYNSPDP NTWFWPFLLL FIVLFAASGI GNGSTFRTVG
VIFDQHQKGP VLGWTSAVAA YGAFIAPRVM GEQIEAGTPE LSMYGFAIFY ALCLILNWWF
YLRKNAYIKN P