Gene Mlg_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2030 
Symbol 
ID4268146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2301812 
End bp2303818 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content64% 
IMG OID638126786 
ProductTonB-dependent receptor, plug 
Protein accessionYP_742862 
Protein GI114321179 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGAC AACTCCTCCG GGCCTACAAA TGGGCAGAAG GCTGGGCTAA CCCATCGGAT 
CATTGGCCAG CGATCACCGG ACTGCTTGGT GCGGTATTGC TTGCCTCAAC CCCTGCCAGC
GCATCTACAG CCGAATTGGA CTCCGTCGTG GTGACCGCAA CCCGCTCACC ACACCCCTTA
TCGGAGGTCC CGGTGGAAAC CCGCTTGCTG GATCGGGAGG CCATCGACCG AAGCCAGGCG
CGCAATCTGC CCCAACTCCT GGGCACCTTG CCGGGGGTGT CCGCGAGCAA CCTGGACGAC
ACCTTGGGGG CCGACAACTT GCGGCTGACG CTACGAGGGC TGCAACTCAA CGAGGGTTAC
GGCCTGATAC TGGTAGACGG GCGTCGGATT CACGGTGGAC TTGGCGCCCA TGGCGATTAC
GGGGTCAGTC TCAACCAAAT CCCCCTGTCT ATGATCGACC GGGTTGAGGT GGTACGCGGA
GCAGGCTCAG CCCTCTACGG TGCCGATGCG ATGGCCGGGG TCATCAACAT CATCACCCGA
CGCCCGGAGC GTGAGCCCAC AGGCCGCGCC GGCCTGGGTG CAGGCTTCTA CGAAACGCTG
GATCGCGACG ATCGGGATGC GACCGGATCC CACCGGCGCG ATGTGGATGC TCATGCCCTT
TACAGCGGCC CGTTTGGCGA GGGGTCGACC TTTCTGGTCG GGGGGCACCA TCAACAGGAC
GAGGGCACCG ACCAAACCGC CGCCACGACT CGCAAAGATA CCGCCATGGG CCATTGGCAG
ACGGACCTGA ACGACCACTG GTCCGCAACC CTACACGGCT ACTTCGCCCG TGCCCGGCGC
GATCATCACG GCCCGGAGGC CCGTCATGAT CGCGCCTACG ACGATGCCAG CCTGACCGCC
GGATTGGACT ATGAGCAGGG GCGACACAGT TTATCCATGG GTGGGTACCA CTTCGACCAG
GACTTCGAGA CCGGCTACCC GGGTTTCGCG CACGGCTTTC GCGACGGGCG CGTGGGCTAT
CGTGAGGCCG ATGTCCGCTA TACCTACTTC GGCGAACGAC ACTGGCTGAC CATGGGGGCC
CAGCATCAGC GCCAAACCCT CGACTACCGC TTCCGGAACT ACGCCGACGG CGCACTGGAA
GACACCGTGC ACGTGGACGA GCGTATCGAC GTCAACAGCG TCTATGTACA GGACGAGATC
TGGCTACTCG GGCAGCGCCT GATTCTGGTC CCCGGAGCCC GCTACGAGGA TCACGACACC
TTCGGCAGCG AACTCAACCC CAAGCTGGCC GCGCGCTTGC AAACCGGGGA CACAACCTGG
CGGGCCTCTA TCGGTCGCGC ATTCAAGTCA CCCACACTGC GCCAGCTCTA TTATCAGGGG
CTATACCGGC ACGGCGACTA CTACCTGGCC TCCAACCCCG ATCTGTCACC GGAGCGGGCG
ATCAGCGCCA ATCTGAGCGT GGAGCGCACG TGGCCGGGCA GCCGGGTCTG GACCGCCCTG
GGGGTCTACC GTACCGAGTT GAAAGACCGG GTCACCCGGG CAGATACCGG CGAGACAACA
AGCCAGGGCG ACCCCATTCA GTCCTACATC AATATCGACC GCTCTCGCAT CGAGGGCATC
GAAGCGGAAA TGCGCGTCGG CGACAGGACC GGCTGGAGCA TGGACGCCGC ACTGGGCCTG
ACCCACGCCC GTGATCGGCG CAGCGGCGAC TGGCTGCCTT ATGTGCCGCG ACACACCGCA
AGCCTTACCC CGCGTTATGT CACCGCTTCG GGGCAGACGG GTATACAGGG GCGGATTAAC
GCCTACGGCA GACAATACCG CGATGCGGCC AATACCCGAC GTATCAGCGC CCACCAAGTG
GTCGATCTGG GGCTTTGGCA TGACCTGACG CCGGCCAGCA CCCTGCGGCT GGACATCAAC
AACGTGTTCA ATTCCGATCG CGGCGAGTCG GCCTTCGCCT TCCGCCAGGG CCGCCGCCTA
GGTGCCCGTA TTGACGTAGA GTTCTGA
 
Protein sequence
MSGQLLRAYK WAEGWANPSD HWPAITGLLG AVLLASTPAS ASTAELDSVV VTATRSPHPL 
SEVPVETRLL DREAIDRSQA RNLPQLLGTL PGVSASNLDD TLGADNLRLT LRGLQLNEGY
GLILVDGRRI HGGLGAHGDY GVSLNQIPLS MIDRVEVVRG AGSALYGADA MAGVINIITR
RPEREPTGRA GLGAGFYETL DRDDRDATGS HRRDVDAHAL YSGPFGEGST FLVGGHHQQD
EGTDQTAATT RKDTAMGHWQ TDLNDHWSAT LHGYFARARR DHHGPEARHD RAYDDASLTA
GLDYEQGRHS LSMGGYHFDQ DFETGYPGFA HGFRDGRVGY READVRYTYF GERHWLTMGA
QHQRQTLDYR FRNYADGALE DTVHVDERID VNSVYVQDEI WLLGQRLILV PGARYEDHDT
FGSELNPKLA ARLQTGDTTW RASIGRAFKS PTLRQLYYQG LYRHGDYYLA SNPDLSPERA
ISANLSVERT WPGSRVWTAL GVYRTELKDR VTRADTGETT SQGDPIQSYI NIDRSRIEGI
EAEMRVGDRT GWSMDAALGL THARDRRSGD WLPYVPRHTA SLTPRYVTAS GQTGIQGRIN
AYGRQYRDAA NTRRISAHQV VDLGLWHDLT PASTLRLDIN NVFNSDRGES AFAFRQGRRL
GARIDVEF