Gene Mlg_2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2788 
Symbol 
ID4269722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3170819 
End bp3171847 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content64% 
IMG OID638127550 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_743618 
Protein GI114321935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000133752 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAA AAAAAACCTA TCGCCCCCGT TGGGCATTTG CAGTTACCGT CACGATGTGT 
GCCCTGGCTG CGGCGCTCGT CTGGCCCATG AGTGCGCAAG CGCAGATGCG GCTGGACGCC
TCCCACCAGT GGCCAGGCGG TCAGGGTGAC GTCCGTGACG AGATGGTCCA GATCATCGCG
AACCGAGCGG AGGAGGCCGA TGTGGGCCTG CAGGTGCGCG TCTACCCCGG CGCTTCCCTG
TACCAGCCCC GTGAGCAGTG GCCGGCACTG TCCCGTGGCC GGCTGGCCAT CACTGCGTTG
CCGCTGGCCT ATGTCGGTGG CCGTGTGCCG GAGGTGAACC TGACCCTGAT GCCGGGCCTG
GTCCGCAATC ACGATCACGC GCGGCGGATT AACGAGTCGC CCTTCATGGA GCGGCTGGAA
GAGATCATGC TCGAGCACGG CGTGAAGGTG CTGGCACATA CCTGGCTGGC CGGGGGTTTT
GGCTCCACTA AGCAGTGCAT CCTGCATCCG GACGACGTGG ACGGCATCAA CATCCGTGCC
GCCGGCGCCG CCTTCGAGCA GATGCTGGCC GAGGCCGGGG CATCCATCGC CTCCATGCCC
AGCTCCGATA TCTATACCGG GCTGCAGACC GGGGTGCTGG ACTCCGCCAA CACCAGCTCC
GCAAGCTTTG TCTCCTTCCG CCTCTACGAG CAGCTGGAGT GCGTGACCCC GCCGGGTGAC
TACGCCCTGT GGTTCATGTA CCAGCCGATT TTGGTCTCCA CCCGCATCTG GGATCGCCTC
GACGAAGAGC AGCAGGCTGT GTTGCTGGAG GCGGGCCAGG AAGCCGAAGA GTTCGCCTAT
CATGCCGCCA TTGAGGCCGA TAAGCGCTTC GCCGAGGTCT ACGAGGAGCA CGGCCGGCAG
GTGGTTTACA TGACCGAGGA CGACTTCAAT GCCTGGCGTG AGATCGCCGA GCGCAGCTCT
TACGCCAACT TCGTGCGCGA TGTGGAGGGC GGCCAGGAAC TGCTCGATAT GGCCCTGGAA
GTAGAATAA
 
Protein sequence
MSSKKTYRPR WAFAVTVTMC ALAAALVWPM SAQAQMRLDA SHQWPGGQGD VRDEMVQIIA 
NRAEEADVGL QVRVYPGASL YQPREQWPAL SRGRLAITAL PLAYVGGRVP EVNLTLMPGL
VRNHDHARRI NESPFMERLE EIMLEHGVKV LAHTWLAGGF GSTKQCILHP DDVDGINIRA
AGAAFEQMLA EAGASIASMP SSDIYTGLQT GVLDSANTSS ASFVSFRLYE QLECVTPPGD
YALWFMYQPI LVSTRIWDRL DEEQQAVLLE AGQEAEEFAY HAAIEADKRF AEVYEEHGRQ
VVYMTEDDFN AWREIAERSS YANFVRDVEG GQELLDMALE VE