Gene Mlg_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1075 
Symbol 
ID4268997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1254735 
End bp1256063 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content56% 
IMG OID638125827 
Productperiplasmic copper-binding 
Protein accessionYP_741917 
Protein GI114320234 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATGA CCTTGAAATC CATCGTTACG CTGATGCTGT TGCTCACCCT CTCCCTGATG 
AAGGGGGCCT GGGCCGAGCC AACGTATGTG CAGCCCGGGG GTATGGGTCT GCAGGAGGCC
ATTGATCAAG CCGAACCTGG CGATACGCTC TCGCTGCGAC CCGGGGTCTA TAGAGGCAAC
TTCCGGATCG ATAAACCCTT GACCCTGAAG GGGGATAACG GGGCCATCCT GGATGGCCAA
GGGGTCGGTG TCACCCTGTC CATTATTGGT GCACCAGATA CCCGTGTGGA GGGCCTGATC
ATCCGCAACA GTGGCATCGA TATGACTGAA ATGGATGCTG CGATCTTTCT CGACAAGGGC
TCCCACCGCA CAGTTATTAC AGGTAACCGG ATTCATTCCC GTGCCTTTGG GATATGGGCC
GGGGAGAGCG ATGAGGTGCT GATTATCGCC AACCGGATCA GCGGTGATAC CCGTATGCGT
TCGGCAGAGC GCGGCGATGG CATCCGCATG TTCCGGCTGA CAGACTCGGT CATCATCGCC
AACGAGATCT GGGAAGCCCG CGACGGCATC TATATCGATG TCAGTCATCA CAACCGTCTG
ATTGGCAACG TCCTGCATAA TCAGCGGTAT GGTATTCATT ACATGTTTTC CCACACCAAC
GATGTGTTGG TGAACCGAAC CTATGATAAC CGCATGGGCT ACGCCCTGAT GATGTCCCGT
CACCTCAATG TACAGGGGAA TACCTCTATC AGGGATCAGA ATTACGGGAT TCTTCTTAAT
GCGGTCACTT ATTCCTATCT CGCAAGGAAC CGTTCTCTTG ACGTGATGCG GGGCCACCCG
CCCGGTACGC CGGATGGGCA TGGCGTTCTT GGCGCCGAGG GCAAAGCCGT TTTTATTTAC
AACTCGCAAC ACAACGAATT TGAAGACAAC CTGTTCGCCC GTGCGGAGAT CGGCGTCCAT
CTGACAGCCG GCTCCAACAA CAACCACTTT CACGGCAATT CATTCGTCGG GAACCAGCAC
CAGGTGATGT ACGTCGCGAA TGTGGAGCAG GAGTGGTCTC ACGAGGGGCG GGGTAACTAC
TGGAGCGATT ACATGGGTTG GGATCTCAGG GGCGATGGCA TCGGTGATGT CCCGTATGAG
CCCAACGATG CCATGGACGG CATCCTCTGG AAATACCCTG CCGCCAAGAT CCTTTTGAAC
AGTCCTGCGG TGCAGGTCCT GCGCTGGGTT CAGCGGCAAT TCCCGGTGCT TCGCCCCAGT
GGCGTGAAGG ACAGCTATCC GCTGATCAGG CCTGCGCATG ATTTGAAGCT CCTGGAGGAA
CTCGGGTGA
 
Protein sequence
MRMTLKSIVT LMLLLTLSLM KGAWAEPTYV QPGGMGLQEA IDQAEPGDTL SLRPGVYRGN 
FRIDKPLTLK GDNGAILDGQ GVGVTLSIIG APDTRVEGLI IRNSGIDMTE MDAAIFLDKG
SHRTVITGNR IHSRAFGIWA GESDEVLIIA NRISGDTRMR SAERGDGIRM FRLTDSVIIA
NEIWEARDGI YIDVSHHNRL IGNVLHNQRY GIHYMFSHTN DVLVNRTYDN RMGYALMMSR
HLNVQGNTSI RDQNYGILLN AVTYSYLARN RSLDVMRGHP PGTPDGHGVL GAEGKAVFIY
NSQHNEFEDN LFARAEIGVH LTAGSNNNHF HGNSFVGNQH QVMYVANVEQ EWSHEGRGNY
WSDYMGWDLR GDGIGDVPYE PNDAMDGILW KYPAAKILLN SPAVQVLRWV QRQFPVLRPS
GVKDSYPLIR PAHDLKLLEE LG