Gene Mlg_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2079 
Symbol 
ID4269398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2356460 
End bp2358196 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content66% 
IMG OID638126835 
Producttype IV-A pilus assembly ATPase PilB 
Protein accessionYP_742911 
Protein GI114321228 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.292786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACTG CGAATCCCAG CGTCAGGCTG AGCGGTCTGG CGGGACGCCT GGTCCGCGAG 
GGCCTGCTTG ATGAAGCGAC GGCCCGAAAG CACACCGAGC AGGCCGTGAA ACAGCGGCGC
CCACTGATCA GCCATCTGGT CGAGGCCAAG GTGCTGCCGG GCCATGAGGT CATCCAGCAG
GCTGCCATCG AGTTCGGCAT CCCGGCACTG GACCTGGCTG CCGTCGAGCC CGACATCAAT
GCCGTCAAGC TGGTTTCGGA GAAGCTGATC CGGACGCACC AGGCGCTGCC CCTGTTCCGC
CGTGGCAAGC GGTTGTTCCT GGGCGTGGTC GACCCCACCA ACCTCGATGC CCTGGACGAG
ATCAAGTTCC ACACGGGGCT GACCACCGAG GCCATCCTGG TGGAGCCGGA CAAGCTCACC
CAGGTGATGG AGCGGGCGCT GGATGCGGCG GCCGGGGGCG ATGCCGCGCT CAAGGAACTG
GACCTGGACG AGGACCTGGA GCACCTCTCG GTTGGCGGCG ATGAGCCCCG GGAAGAGCGG
GAGGTCGGCA TCGCCGGCGA GGGCGAGAAG GACGACGCCC CGGTGGTACG ATTCGTCAAC
AAGCTCCTGC TCGACGCCAT CCGCAAGGGC GCCTCGGACA TCCACTTCGA ACCCTTCGAA
AAGGAGTACC GGGTCCGTTT CCGCCAGGAC GGCATCCTCT ACGAGGTCTC CAAACCGCCG
GTGAACCTGG GTGGACGCCT GGCCGCGCGC CTGAAGGTGA TGTCGCGCAT GGACATCGCG
GAAAAGCGCG TACCCCAGGA TGGGCGTATC AAGATGAACA TCTCCCGCAA CCGGGCCATC
GACTTCCGGG TCAGTACCTG CCCCACGCTT TTCGGCGAAA AGGTGGTGCT GCGTATCCTC
GACCCCAGCA GTGCCCAGAT GGGGATCGAC TCCCTGGGCT ACGAGCCGGA GCAGAAACAG
CGCTACCTGG AGGCACTGAG CAAGCCCTAC GGGATGATCC TGGTCACCGG CCCCACCGGC
TCGGGCAAGA CGGTGTCGCT GTACACCGGC CTCCACATCC TCAACACCCC GGACCGGAAC
ATCTCCACCG CCGAGGACCC CTCCGAGATC AACATGCCCG GGGTCAACCA GGTCAACATC
AACCCCAAGG CCGGGCTCAC CTTTGCCAAT ACCCTGCGCG CCTTCCTGCG CCAGGATCCG
GACGTCATCA TGGTGGGTGA GATCCGTGAT CTGGAAACTG CCGAGATCGC CATCAAGGCG
GCCCAGACCG GGCACCTGGT GCTCTCCACC CTGCACACCA ATGACGCCCC GCAAACCCTG
ACCCGCCTGG CCAACATGGG TGTGCCCGCC TACAACATCG CCTCCTCGGT GACCCTGATC
ATCGCCCAAC GGCTAGCCCG CCGGCTCTGC AAGCACTGCA AGGTGCCGGA GGAGGTCCCC
CGCGAGACCC TGCTGGAGGA GGGGTTCACC GAAGCGGATC TGGAGGCCGG CGTCACGGTC
TACGCACCCC AGGGCTGCGA GCACTGCACC GAGGGCTACA AGGGCCGGGT GGGTATCTAC
CAGGTGATGC CCGTCTCCGA CGCCATGGGC CGCCTGATCA TGGAGGGCGG CAACGCCATG
CAGTTGGCCG ACCAGGCGGC CAGAGAGGGG ATTGATGACC TGCGCCGCTC CGGTCTGCGC
AAGGTCATTC AGGGGATGAC CAGCCTGCAG GAAGTCAACC GGGTCACCAA GGACTGA
 
Protein sequence
MVTANPSVRL SGLAGRLVRE GLLDEATARK HTEQAVKQRR PLISHLVEAK VLPGHEVIQQ 
AAIEFGIPAL DLAAVEPDIN AVKLVSEKLI RTHQALPLFR RGKRLFLGVV DPTNLDALDE
IKFHTGLTTE AILVEPDKLT QVMERALDAA AGGDAALKEL DLDEDLEHLS VGGDEPREER
EVGIAGEGEK DDAPVVRFVN KLLLDAIRKG ASDIHFEPFE KEYRVRFRQD GILYEVSKPP
VNLGGRLAAR LKVMSRMDIA EKRVPQDGRI KMNISRNRAI DFRVSTCPTL FGEKVVLRIL
DPSSAQMGID SLGYEPEQKQ RYLEALSKPY GMILVTGPTG SGKTVSLYTG LHILNTPDRN
ISTAEDPSEI NMPGVNQVNI NPKAGLTFAN TLRAFLRQDP DVIMVGEIRD LETAEIAIKA
AQTGHLVLST LHTNDAPQTL TRLANMGVPA YNIASSVTLI IAQRLARRLC KHCKVPEEVP
RETLLEEGFT EADLEAGVTV YAPQGCEHCT EGYKGRVGIY QVMPVSDAMG RLIMEGGNAM
QLADQAAREG IDDLRRSGLR KVIQGMTSLQ EVNRVTKD