Gene Mlg_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2120 
Symbol 
ID4269370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2405703 
End bp2407646 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content70% 
IMG OID638126876 
Productvon Willebrand factor, type A 
Protein accessionYP_742952 
Protein GI114321269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4548] Nitric oxide reductase activation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.876346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAGT TCCTGGAGGT GGAAGAGTTC GTCGGTCGCC ACTGGCACCG TTGGGCATCG 
CAGGCACGGA GCTACCCGCG CCACCCGGAG GCCGCGGTCC AGCTCGCCAC CCTGCGCGGC
CTCCTGGGCG TGTTCTTCCG CGCCAGCGGG GGGCCGGCGG GGGTGCCGGT GGCCTCCATC
GTGGCCCGCA ACTCCCGTCA CCGCCTGACC TGGCGGCAGC GCCTCGGGTT CGACGAGGAG
CCGGTGGATC GGGCCCGCCG CGATGAGGAG AACCTGCTGC TGCCACCGGT GCTGGACTAC
TTCCCCACCG CGGCTCAGAA CCGCGACCAC TATCTTTGGC TGGCGGCCTT CCTGGCCCTG
GCGCGGCCAC CGCGCACGGA CCGCCTGCAC GACCCATTGC AGCAGGATAT CGTCCGGCTG
CGGGAGGTCC ACCGCGTGAT CCAAACCATC CGCGACCGGC TACCGGGGCT CCACCAGCGT
TACCGCGCCC TGGGCAGCGC CATGCTGGCA CTGCGCCCCC GGCGCCGCCT GCCGCCCCAG
GAGTCCGCCG TGGAGCTGGC CGTGCAGTAC CTGCTCGGGG CCGCCCTGCC GGGTCGGAGC
GCCGCCACGG CCATCGTCCG CGCCGTCACC GATCCACAAG TGGCCCTGGA TGCGTTCCAG
GCCGACCGGG ATTACCGACC ACCCCTGCCC ATGCCGCTCT GGGGCGAGGT CGTCCCGCTG
GGCACGGGGA CCGGCGCCAA GCCGGGCGAG GGCCACGACG AGGGCGGCGC CACGGGCAGC
CCCAAGCAGG CCAGCGAGGG CAAGCGCCAG GCCGAGCGCC GTCACCAGGA CCAATGCGAG
CGCGACGACC CGCTGGTCCT CAACCGCTTC GAGAAGATGC TCTCCTGGAC GGAGATGGTG
AACGTCAACC GGCTGGTTGA AGACGAGGAG GACGAAGAGG CCAAACGGGC TGCTGAACAG
ATCGAGGAGA TCACCCTCAG CCCCCATAAG CAGCGCGCGG CCACCCGGCT GAAGGTGGAC
CTGGACCTGC CGCCGGACGC CGTCACCGGC GATCGCCTGC GCGCAACCCA CACCTACCCG
GAGTGGAACT TCCGCAAGCA GGCCTACCTG CCGGACCACT GCGCCGTGCA CACGGACCTG
CAGCCCGAGG AGGGCGAGGC CTGGCGCCCC GATGCCGGCA CCCGCCGGCG CATCCGCCGG
GTGCAACGCC AGTTCGAGGC CCTGCGCCCG CGCCGGGAAC TGCTGCGCGC CCAGATCGAT
GGCGCGGAAC TGGACATGGA CGCCACCATC CGCGCCCATT GCGACCTCCG GGCCACCGGC
GAGGGCTCCG ACAACATCTA CCAGGCGGCC CGCTGCCAGG CCCGCGACCT GGCGGTGGCG
ATCCTGGTGG ACTGCTCGCT CTCCACCGAC GCCTGGCTGG AGGATCAGCG GATACTGGAT
GTGGAGAAGG AGGCCCTGCT GGTGCTGGCC CATGGCCTCA AGGGCTGCGG AGACGATTAT
GCCATCTACA CCTTCACCTC CCACCGGCGG CAGAAGGTCT GGGTAAATAC CGTCAAGGCC
TTCGACGAAC CCCTCCAGGC GCGGGTGGAG CGCCGGATCG GGGCACTCAA GCCCGGCCAT
TACACCCGCA TGGGACCGGC GCTACGCCAC GTCTCCGGCG AATTGGCCAA ACGGCCCAAT
AGGCACAAAC TGCTACTGGT GCTCACCGAC GGCAAGCCCA ACGATACCGA CTACTATGAG
GGCCGCTACG CCATCGAGGA CACGCGCAAG GCGGTACGGG AGGCCCGGCG CCAGGCCCAG
ACGGTGTTCG GTGTCACCGT GGACAGCGAG GCCCAACAGT ACTTTCCTTA CCTGTTCGGA
CGGGCCGGCT ACAGCATCGT CCAGCGGCCC GCCCACCTGG CCCAGAGCTT GCCGGCCATC
TACCGTCAGA TCATCAGCGA ATAG
 
Protein sequence
MLQFLEVEEF VGRHWHRWAS QARSYPRHPE AAVQLATLRG LLGVFFRASG GPAGVPVASI 
VARNSRHRLT WRQRLGFDEE PVDRARRDEE NLLLPPVLDY FPTAAQNRDH YLWLAAFLAL
ARPPRTDRLH DPLQQDIVRL REVHRVIQTI RDRLPGLHQR YRALGSAMLA LRPRRRLPPQ
ESAVELAVQY LLGAALPGRS AATAIVRAVT DPQVALDAFQ ADRDYRPPLP MPLWGEVVPL
GTGTGAKPGE GHDEGGATGS PKQASEGKRQ AERRHQDQCE RDDPLVLNRF EKMLSWTEMV
NVNRLVEDEE DEEAKRAAEQ IEEITLSPHK QRAATRLKVD LDLPPDAVTG DRLRATHTYP
EWNFRKQAYL PDHCAVHTDL QPEEGEAWRP DAGTRRRIRR VQRQFEALRP RRELLRAQID
GAELDMDATI RAHCDLRATG EGSDNIYQAA RCQARDLAVA ILVDCSLSTD AWLEDQRILD
VEKEALLVLA HGLKGCGDDY AIYTFTSHRR QKVWVNTVKA FDEPLQARVE RRIGALKPGH
YTRMGPALRH VSGELAKRPN RHKLLLVLTD GKPNDTDYYE GRYAIEDTRK AVREARRQAQ
TVFGVTVDSE AQQYFPYLFG RAGYSIVQRP AHLAQSLPAI YRQIISE