Gene Mlg_2397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2397 
Symbol 
ID4269394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2722741 
End bp2723799 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content65% 
IMG OID638127155 
ProductWD40 domain-containing protein 
Protein accessionYP_743227 
Protein GI114321544 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000884296 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGTAAGC GCAAACAAGG GTGGTTGGGC CTCCTGGTCG GGCTCATGGT CCTGCCGGGT 
ATGGCCCTGG CGGCGGACGA TAAGCCTGAT GACGCGGCCG TGCTCTTCGT CTCCAACCAG
GAAGGGGCAC AGAGCATCTA CTCGGCCAGT CTTAATGATG GCGCGATTGT CCGGTTGACC
GATCCGGCCC ACTCCGACAT GGACCCGCGC TGGTCGCCTG ACCGCCAGCG GATTGCCTTT
GTCTCCCGGC GGGACGGCAA CGGCGATATC TACCTGATGG ACGCAGACGG CAGCAACCAG
CAACGTCTCA CCCACAGCGA GCGCATGGAC TTCATGCCCC AGTGGCACCC CTCCGGCGAT
TATCTCGCCT TCACCTCCAG CCGGGTCAGT CCGCGCGGGG TCTTCCTACT GGACCTGGCC
ACCGGGGAGG CGCGGTTGCT CAGCGAGGCG GTGCGTTCTC CGGAGGCCCT GCGTTGGTCG
CCGGATGGGG GGCAGTTGGC TGTCATCGCG CGGCCCGGGG GCGAGGGTGG CAATGCCATC
ATGGTGATCG ATCTTGAGGA CGACGGGCAC AGTATCCTGG TGCCCAATGA CCGGCATGCC
GGGAATGTGC AGTCCCTCGC CTGGCACCCC AGCGGCGATT ACCTGGCCTA TACCGCCTCC
ACCGATAACC GCCGCGAGGT GCAACTTTAC GTGTTGCACG TTCAGGAGGG CCACTCGGAG
CAGGTCGCGT CCGCCCCGGG CAACGTACGG GGTTTCCCGG TTTGGTCCAC CGACGGGGAC
TGGCTCGTCT ATGCCTCCAC GGCGACCCCC GCTCCGGAGG AGACCAAGAC CAATATCTAC
GCCTCCCGCT TCCCCGATGA CGGCCGACCG GTAACGGTCG CCACCCTTGA CGGACAGCTG
GCTCAGCCGG TCTGGCTGCC AAATGATGGG GGGGAGGTTG TCTTCGTGTC GCAAAAGGGC
GGTGCAGCGG AATTGTTTCG CGGTCGGGTC GATGGTGCGG AGCCGTCGCT GGTGTTTGCG
CAGCCCGCAT ACATGCATTC GCCCCGCACA GGGCAGTGA
 
Protein sequence
MSKRKQGWLG LLVGLMVLPG MALAADDKPD DAAVLFVSNQ EGAQSIYSAS LNDGAIVRLT 
DPAHSDMDPR WSPDRQRIAF VSRRDGNGDI YLMDADGSNQ QRLTHSERMD FMPQWHPSGD
YLAFTSSRVS PRGVFLLDLA TGEARLLSEA VRSPEALRWS PDGGQLAVIA RPGGEGGNAI
MVIDLEDDGH SILVPNDRHA GNVQSLAWHP SGDYLAYTAS TDNRREVQLY VLHVQEGHSE
QVASAPGNVR GFPVWSTDGD WLVYASTATP APEETKTNIY ASRFPDDGRP VTVATLDGQL
AQPVWLPNDG GEVVFVSQKG GAAELFRGRV DGAEPSLVFA QPAYMHSPRT GQ