Gene Mlg_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2064 
Symbol 
ID4270450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2340241 
End bp2341614 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content71% 
IMG OID638126820 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_742896 
Protein GI114321213 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.174076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAG CGCCCAGCCC CCGTCGCCCG ACCCTGAGGG CTCGCCTGCC GGGCGGCCCC 
CTGGCGATCT GGGGCGCCAT CCTGTTCCTG GGGGTGATCG CCCTCGCCGT CACCTGGCAA
TTCGTGGAGC CGGCCCCGCC GCGGCAGGTG ACCCTGGCCA CAGGGGCCCC CGGCGGTGCC
TACGAACAGA TCGGCGCCGG CTACGCCGAG TGGTTCGCCG ACAAGGGGAT CACCCTGGAG
ACGGTCGCCA CCCAGGGGGC CACGGAGAAC TGGCAGCGGC TGCTGGCCGG CGAGGTGGAC
GCCGCCATCG TGCAGGGCGG CACCGCCCCG GCCGGGGCCG GCGACCAGCT CGAGGGGCTG
GTGAGTGTCG CCTACGAGCC GCTGTTCGTC TTCTACCGGG AGGACGCGCT GAATGCCGCC
CACCTGCTGG GCGGCGACCC ACCGGTGCCA CAGCGACTGG AGACCCTTTC CGGGCTGCGC
ATCGCCATTG GCTCCGAGGG CAGCGGCACC CGGACCCTGG TCCGGACCCT GCTCGATGAG
CTGGGTCTGG CCACGGACGA TGCAACCGAC ACCGCGTTGG TGGCTATCGG CGGCGAGGCC
GCTGCGCGGG GATTGCTGGA GGGCAAACTG GGTGCGGCCG CCTTCGTCAT GTCACCCACC
GCCCCGCTGG TACAGCGCCT GCTGGCCGCC GAAGGCATCG GGGTGCTCAA TCAGGCCCAG
GCCCCCACCT TCACCCAGCG GCTGCCCTAC CTGGCCACCG TCACCCTGCA CGAGGGGGTG
GTGGACCCGC GCCGCAACCT GCCCCCGCAC CGGGTTCAGA TGCTGGCGCC GGCCACCTAC
CTGGTCACCC GCAAGGACAC CCACCGCGCC ATCGCACAGC TGCTGGTGGA GGCGGAGCAG
CGCAGCCGCC GCATACACCT GGTGGGCAAC GGGGATCAAT TCCCGTCGCT GGCCCACATG
GACATCCCGG TCTCCGACCA GGCCCGCTAC TTCTTCCAGC GCGGCCCCGG GTTTCTGCAC
CGCCACCTGC CCTTCTGGGC CGCCTCGCTG GTGGATCGCC TGGCCATTCT CATCATCCCG
CTGCTGACCA TCATCATCCC CTTGGTGCGC ATCGCCCCGG CGGCGGTGAC CTGGAGCATG
CGCCGGCGGA TCTTCCGTTG GTACCGCCAG TTGCGGGTGA TCGACGAGGA GCTGGGCCGG
CCACAGCTGC CCGTCGCCCG GCTGGAGAGC AACCTGGCCC AGTTGAAGCA ACTGGACCAC
GACGTGTCCG GGACGGAGGT GCCGCTGTCC TACATGGAAG AGTTTTATAA CCTGCGGCTG
CACATCGCCT ACATGCGCCA GCGGGTCCGC GAGCGCCTCG GCAACAGCGT CTGA
 
Protein sequence
MAAAPSPRRP TLRARLPGGP LAIWGAILFL GVIALAVTWQ FVEPAPPRQV TLATGAPGGA 
YEQIGAGYAE WFADKGITLE TVATQGATEN WQRLLAGEVD AAIVQGGTAP AGAGDQLEGL
VSVAYEPLFV FYREDALNAA HLLGGDPPVP QRLETLSGLR IAIGSEGSGT RTLVRTLLDE
LGLATDDATD TALVAIGGEA AARGLLEGKL GAAAFVMSPT APLVQRLLAA EGIGVLNQAQ
APTFTQRLPY LATVTLHEGV VDPRRNLPPH RVQMLAPATY LVTRKDTHRA IAQLLVEAEQ
RSRRIHLVGN GDQFPSLAHM DIPVSDQARY FFQRGPGFLH RHLPFWAASL VDRLAILIIP
LLTIIIPLVR IAPAAVTWSM RRRIFRWYRQ LRVIDEELGR PQLPVARLES NLAQLKQLDH
DVSGTEVPLS YMEEFYNLRL HIAYMRQRVR ERLGNSV