Gene Mlg_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1943 
Symbol 
ID4268111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2209349 
End bp2210632 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content64% 
IMG OID638126697 
ProductTRAP dicarboxylate transporter, DctM subunit 
Protein accessionYP_742775 
Protein GI114321092 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0780102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGA TCATGGTCGG CATCATGGTG GGGCTGCTGC TCCTGGGCTT CCCGCTGATG 
GTGCCGCTGC TCTCCGCAGC CCTCTACGTC ATGCTGTTCG AGCTGGATTT CATCAGCACG
AACCGCATTG TCGCGCAAAT GGTCTCCGGC ATCTCCTCGC CGGTGCTGGC GGCGGTGCCG
TTGTTCATCC TCGCCGCCGA CATCATGACC AAGGGCCGCA CCGCCAACCG TCTGTTGGAT
CTGGTCATGA GCTTCTTCGG CCACCTGCGC GGGGGGCTGC CGGTCACCGC GGCCATCAGC
TGCACCCTTT TCGGTGCGGT CTCCGGCTCC ACCCAGGCGA CGGTGGTGGC CATCGGCGGG
CCGCTGCGCC CAAAGCTGAT CAAGGCCGGC TATAAGGACA GCTTCACCAC CGCACTGATC
ATCAACGCCA GTGACATCGC CCTGCTCATC CCGCCGAGCA TCGGCATGAT CGTCTACGGC
GTGGTCTCCC GCACCTCGGT GCGCGAACTG TTCATCGCCG GCATCCTGCC CGGGCTGCTG
ATCCTGCTCT TCTTCTGTGT CTACACCTAT ATCTACTCCC GGCTAAAGCA GATCCCGGTG
CAGGACAGGT CAACCTGGTC GATTCGGCTG CAGGCCCTGC GCGGGGCCTT GCTACCCATG
GGCTTCCCGA TCATTGTCGT GGGTGGCATC TACGCCGGCT TCTTCTCCCC CACCGAGGCA
GCGGCCGTCT CCGTGGCCTA CGCCTTCCTG CTGGAGGTGG TGATCTTCCG GTCGCTGCAC
ATCAAGGAGA TCTGGCCCAT CGCCCTGTCC ACCGGGTTGA TCACCGCGGT GGTCTTCGTG
CTGGTGGCCT CCGGCCAGGT CTTCTCGTAC GTGGTCTCGG CAGCGCGGAT CCCCCGGGAG
TTGATCGGGC CGCTGATCGA GACCCTGGCG GGCAACCCCG AGATGGCGTT AATCGTCATC
GCCCTCGCCT ACTTCATCGG CTGTATGTTC GTGGACCCCA TCGTGGTCAT CCTGGTGCTG
ACACCGATCT TCACCCCGCT GGTGGACGCC ACCGGGCTCG ACCCGGTGCA CGTGGGCGTC
ATCGTCACCC TGCAGGCGGC CATCGGTTCG GCCACGCCAC CCTTTGGCTG CGACATCTTC
ACCGCCATCG CGATCTTCAG GCGACCTTAC TGGGACACCA TCAAGGGCAC GCCGCCGTTC
ATCTTTATCC TGTTGCTGTC CACGGCGGTG CTCATCGCCT TCCCGCAGAT CTCGCTGTTC
CTGCCGCAAC TGGCCTTCGG CTAG
 
Protein sequence
MIWIMVGIMV GLLLLGFPLM VPLLSAALYV MLFELDFIST NRIVAQMVSG ISSPVLAAVP 
LFILAADIMT KGRTANRLLD LVMSFFGHLR GGLPVTAAIS CTLFGAVSGS TQATVVAIGG
PLRPKLIKAG YKDSFTTALI INASDIALLI PPSIGMIVYG VVSRTSVREL FIAGILPGLL
ILLFFCVYTY IYSRLKQIPV QDRSTWSIRL QALRGALLPM GFPIIVVGGI YAGFFSPTEA
AAVSVAYAFL LEVVIFRSLH IKEIWPIALS TGLITAVVFV LVASGQVFSY VVSAARIPRE
LIGPLIETLA GNPEMALIVI ALAYFIGCMF VDPIVVILVL TPIFTPLVDA TGLDPVHVGV
IVTLQAAIGS ATPPFGCDIF TAIAIFRRPY WDTIKGTPPF IFILLLSTAV LIAFPQISLF
LPQLAFG