Gene Mlg_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2080 
Symbol 
ID4269399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2358221 
End bp2359474 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID638126836 
Producttype II secretion system protein 
Protein accessionYP_742912 
Protein GI114321229 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.917677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.551071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCG CCACCGCAAA GAAAGCCAGT CGCAAGAAGC CCAGGCAGCA ACCCGTGTTC 
AACTGGGAGG GCACCGACAA GCGCGGCGCC AAGGTCAAGG GCAGCATGCA CTCGGAGAAT
GCCCTGGCGC TGAAGGCCGA GCTGCGCCGC CAGGGCATCA TCCCCTCCAA GGTGCGCAAG
CGCTCCGCGC TGGAGGACCT GCTCAGCGGC GGCAACAAGA AGATCAAGCC GGCGGACATC
GCCTATTTCA GCCGCCAGCT CGCCACCATG CTGCAGTCCG GCGTGCCGCT GGTTCAGGCC
CTGGATATCG TCGGCAAGGG GGACGAGCAC GCCGGCATGC GCCAGCTCGT GGCGGAGATC
AAGAACGATG TGGAGTCGGG CACGGCCCTG CACACCGCCC TACAGAAGCA CCCGCGCTAT
TTCGATGACC TGTTCGTCAG TCTGGTGGCG GCCGGGGAGT CCGCGGGGGT GTTGGACACC
CTGCTGGACA AGATCGCCAC CTACAAGGAA AAGACCGAGT CGATCAAGGG CAAGATCAAG
AAGGCCCTGT TCTACCCCAC GGCGGTGATC GTGGTGGCCA TCGTGGTCAC CGCCATCCTG
CTGATCTGGG TCGTGCCGCA GTTCGAGTCG CTGTTCCGCG GCTTCGGTGC CGACCTGCCG
TTGTTCACCC AGATGGTGAT CAACCTGTCG GACTTCATGC AGAGCTACTG GTTCATCATG
CTGGCCGCGG CCATCGGGCT GGGCTGGGGG TTCAGCACCG CCAAGCGACG ATCGAAGGCC
TTCTCACGCA GCGTGGACCG GTTTTCGCTG AAGATCCCTG CCATCGGCAA CATCCTTTAC
AAGGCCTCGG TGGCCCGCTT CGCCCGTACC CTCGCCACCA TGTTCGCCGC CGGGGTGCCC
CTGGTGGAGG GGCTGCGCTC GGTGGCCAGT GCCACCGGCA ACTATGTGTT CGAGTCAGCG
GTGCTGCAGA TTCGCGAGCA GGTGGCCGCC GGCCAGCAGC TGCAGATCTC CATGCGACTG
TCCAATCTCT TCCCCAATAT GGCCATCCAA ATGGTGGCCA TCGGCGAGGA GTCCGGCTCG
TTGGACAGCA TGCTCGCCAA GGTGGCCGAC TACTACGAGG AGGAGGTGGA CAACGCCATC
GATAGCCTCA GCAGCCTGCT GGAGCCGATG ATCATGGCGA TCCTCGGCAT CCTGGTGGGC
GGACTGGTCA TCGCCATGTA TCTGCCCATC TTCCAGATGG GCGCCGCCAT CTGA
 
Protein sequence
MAVATAKKAS RKKPRQQPVF NWEGTDKRGA KVKGSMHSEN ALALKAELRR QGIIPSKVRK 
RSALEDLLSG GNKKIKPADI AYFSRQLATM LQSGVPLVQA LDIVGKGDEH AGMRQLVAEI
KNDVESGTAL HTALQKHPRY FDDLFVSLVA AGESAGVLDT LLDKIATYKE KTESIKGKIK
KALFYPTAVI VVAIVVTAIL LIWVVPQFES LFRGFGADLP LFTQMVINLS DFMQSYWFIM
LAAAIGLGWG FSTAKRRSKA FSRSVDRFSL KIPAIGNILY KASVARFART LATMFAAGVP
LVEGLRSVAS ATGNYVFESA VLQIREQVAA GQQLQISMRL SNLFPNMAIQ MVAIGEESGS
LDSMLAKVAD YYEEEVDNAI DSLSSLLEPM IMAILGILVG GLVIAMYLPI FQMGAAI