Gene Mlg_2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2639 
Symbol 
ID4270684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2989944 
End bp2991329 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content71% 
IMG OID638127398 
ProductTolC family type I secretion outer membrane protein 
Protein accessionYP_743469 
Protein GI114321786 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTGC CCCGGCGAGT CGGCCGTCCG GCGCTGGCGT TGCTGCTGCT CGTGGCCCTG 
CTGGTGGCGC CGCCGCTCAC GGCGGCCGAC GATCTGCTGC GCGTCTATCA CGAAGCGCGC
GAGAGCGACC CCCAGCTACG CCAGGCCCGC TCGGAGCTCG AGACCGTGCG CGAGCGGGTG
CCCCAGGCCC GTGCCGGACT CCTGCCCGAA TTGAGCGCCT TCGGTTCCGC CGACCGCGAG
CGCAACGAAC CGCGCGGCAG CGGTGCCGGT GAGCTGGGCA CCGACTATTA CACCACCAGC
GCCATCGGGC TGGACCTCAC CCAACCGGTC TTCCGCTACG GCCTCTTCCT GCGCCTGCGC
GAGGCGGACC TGCAGGTGGC CCAAGCCCAG GCCCGGCTCA ACAGTGCCGA GCAGGACCTG
ATGGTTCGGG TCACTGAGCG CTATTTTGAG GTGCTGGCGG CCCTGGATGG CGTGGCCTTT
GCCCGCGCCG AGTTGCGCGC CATCGAGCGC CAGTTGGACC AGGCGGAACA GCGCTTCGAG
GTGGGGATGG CGCCGGTCAC CGATGTGGAA GAGGCCCGCG CCCGGCGGGA TCTCTCCCAC
GCCAACCTGT TGCAGGCCGA AACGGATCTG GACAGCGCCC GGGAGGGCCT GCGCGAGCTG
ACCGGACGCA GCCACCGTGA GCTCGCCCTG CTGGACGAGG CGCTCCCGCT GGAGTCCCCG
GAGCCGGAAG ACAGCGAGGC CTGGGCGCGG CAGGCGGAGC GCCAGAACTG GGACCTGCTG
GCCTTCCGTC ACGGTTCCGA GGCGGCGATG GAGAACATCG GTGTCCAGCG TGCCGAGCAC
CTGCCCACGG TCGACCTGGT CGGCAATGTG CAGCGGGTGG ATCAGGGCAG CCGTACCACC
GCGACCGGCG TGCCCCGGGG TCCGGAGGAG TTCGACCAGG CGAGCATCGG GCTGCGGCTG
AACCTGCCCC TCTACCGGGG TGGCGCCACC CGCTCCCGGG TGCGCGAGGC GCAGTCCCAG
TACACCACGG CATTAGAGGA GCTCGAGGAG ACCCGCCGTG CCGTGGTCCG CGGCGCCACC
GACGCCTACC GCGGTGTGCG CAGTGCCATC GCCCGGACGG CGGCCTTCGA GCAGGCCATC
ACCTCCACCC AGCGGGCGCT GGAGGCGGTG GAGGCCGGCT TCGAGGTGGG GACGCGGACG
GTGGTGGAGC TGTTGGATGC CCAGCAGGAC CGGTTGGGCG CCGAGCAGGA CTTCCGCCAG
GCGAATTACG ATTACCTGCT GCAGACCGTG CGTCTGAAGC GTTTCGCCGG CACCCTCTCC
GATGCCGATC TGGCGACCAT CAACACCTGG TTGGACACCA ACGCCGAGGT TGTCCCCGAT
ATCTGA
 
Protein sequence
MPLPRRVGRP ALALLLLVAL LVAPPLTAAD DLLRVYHEAR ESDPQLRQAR SELETVRERV 
PQARAGLLPE LSAFGSADRE RNEPRGSGAG ELGTDYYTTS AIGLDLTQPV FRYGLFLRLR
EADLQVAQAQ ARLNSAEQDL MVRVTERYFE VLAALDGVAF ARAELRAIER QLDQAEQRFE
VGMAPVTDVE EARARRDLSH ANLLQAETDL DSAREGLREL TGRSHRELAL LDEALPLESP
EPEDSEAWAR QAERQNWDLL AFRHGSEAAM ENIGVQRAEH LPTVDLVGNV QRVDQGSRTT
ATGVPRGPEE FDQASIGLRL NLPLYRGGAT RSRVREAQSQ YTTALEELEE TRRAVVRGAT
DAYRGVRSAI ARTAAFEQAI TSTQRALEAV EAGFEVGTRT VVELLDAQQD RLGAEQDFRQ
ANYDYLLQTV RLKRFAGTLS DADLATINTW LDTNAEVVPD I