Gene Mlg_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2402 
Symbol 
ID4269989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2730272 
End bp2731258 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content73% 
IMG OID638127160 
Productgeneral secretion pathway protein C 
Protein accessionYP_743232 
Protein GI114321549 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3031] Type II secretory pathway, component PulC 
TIGRFAM ID[TIGR01713] general secretion pathway protein C 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.237406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00100026 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGCCG AATCAGGATA CAGAGGGCTG GGTATTGCCG GGGTCGAGCG GTTGGGCGCC 
TTTGCCGGCG GCCTGCTGAC CGGGCGTGGC GGGCACTGGC TGCGCTGGCT GCTGGCGGTG
CTGCTGGTGA TTCTGCTGGC GCAGGGGGGT GCCCGCCTGA CCTGGTGGCT GCTCGGTTAC
GGCCCGGCCC AGGAGGTGCC TGCCGCGGCG TGGTCCGGGC CGGAGGAGCC TGCCGCGCCT
GGCGGCGATG AGGCCGATGC CGGCGAGACG CGCCTGGCGG CGGTGGCCCG CCTGCACCTG
TTGGGGCAGG CCGAGGACGA CGACGAGCTG GACGCCGTCC TGGCCACCGA GGAACTCCCG
GAGACCCGGC TCAACCTGGA GCTGAAAGGG GTGCTGGCCC GTGGCGGCCA GGGCCAGGGC
GCGGCGCTGA TCGCCAGCCG GGGCCGCACT GAGGTCTTCC GGGTATCGGA CGAGGTGCCG
GGTGGCGCCA CCCTGGTCCA GGTCCACACC GACCGGGTGG TGCTGCGCCG TGACGGCCGC
CACGAGCTGC TGCGCCTGCC CCGGAAAGTG GCCGAACTGC TCGCGAGCGC CGACTTCGAT
GTGCCCGGGA CCCAGGCCGG GCGTGGGTCG GATGCCGCCC GGGTGCTCCC GGCGGCCGCG
CGGGGCGATG TGGACCGCGA GGCCCTGTCC GACCTGCGCG GTGAACTCAC GCGCAACCCG
GAACGGCTGT GGGATATCGT CAACGTCCGG CCGGTGATGG AGGGCGGACG ACTGCAGGGC
TACCGGCTGC AGCCCGTCCA GCACCAGGCG CTGTTCCGTC AGGCCGGACT GCGGGATGAT
GACGTGGTGA CGGCGGTCAA CGGCGTGGGT CTGGATAACC CGGCGCGGAT GGGGGAACTC
ATGGGCAGCC TGGCGACCGC CGACCGGATA ACCCTGGATG TCCGCCGGGA CGGGCGGATG
GAAACCGTGA TAGTGGAGCT GCAATGA
 
Protein sequence
MAAESGYRGL GIAGVERLGA FAGGLLTGRG GHWLRWLLAV LLVILLAQGG ARLTWWLLGY 
GPAQEVPAAA WSGPEEPAAP GGDEADAGET RLAAVARLHL LGQAEDDDEL DAVLATEELP
ETRLNLELKG VLARGGQGQG AALIASRGRT EVFRVSDEVP GGATLVQVHT DRVVLRRDGR
HELLRLPRKV AELLASADFD VPGTQAGRGS DAARVLPAAA RGDVDREALS DLRGELTRNP
ERLWDIVNVR PVMEGGRLQG YRLQPVQHQA LFRQAGLRDD DVVTAVNGVG LDNPARMGEL
MGSLATADRI TLDVRRDGRM ETVIVELQ