Gene Mlg_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2161 
Symbol 
ID4270156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2456424 
End bp2457644 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID638126917 
Productpilus assembly protein CpaE 
Protein accessionYP_742993 
Protein GI114321310 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0776455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0348695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATG AGGTGGCGGC GACCCAGGGC GCCCATCGGT TCGTGGCCGC CCTGCCGGAG 
GGGCGGGCGC TGGAATGGCT GAAGCTCAGC CTGGGGGAGA TGGGCACGGT GGTGCCGGCG
GAGACCGGTA ACCTGGAGGA GATCCGCGGG GTGTTGGACC TGACCGACAC ACCGCTGCTC
TTCGTCTGGA TGGACCGCCA CAACCTGGCA CAGTCCGCAG CCCTGGTGGA GGGTATCCTC
GACGTCAAGT CCTTGATCAC CGTGATTGCG GTGGGGGAGG GGGTGCACCA GGACGAACTG
CTGGCGGCCA TGCGGGCGGG GGCGCGGGAC TTTCTCACCG TGGGCACCCG GGCCAGCGAG
GTGCGGGCCC TGATCCGCCG GGCCCTGGAC AAGGCCCCGG TGCAGCCCAG CGATGCCGCC
GACAAGGGGC GGGTCTGGGC GGTCATGAAC GCCCGGCCCA GCATGGCCAA CGCCTTTTTC
TGCACCCATT TGGCCCAGGC CATCCAGCGG GACAGCCGGG ATGCCCAGGT CCTGTTGCTG
GACCTGGCGA TCCCGCCGGC CGACTCCCTC GCCCTGCTCA ACCTCAAGTC CTCCTTCTCC
TTTTTCGATG CGGTCCGCAA TCTGAAGCGG CTGGACCGGA CCCTGCTGGT GAACGCCCTG
CCCACCCACG CCACGGGGCT GCAGGTGCTC TCCATGCCGG ACTCCTTCGA GGACGAGGAA
GAGGAGGTGA GCACCGCCGA GCTCTATCTG CTCCTGGGCT CGCTGAAGCG CTACTACAGC
CACCTGGTGG TGAACCTGGG TGGGTTGCCC GCCGGCGGGT TCCTGAATGT CATGCTGAGT
GGTGCGGACG AGGTGTTGCA GGTGGTGGAC CAGAGCATCC CCAGTTGCCA GCAGAACCTG
CGCCGGATCC GCCAGGTGGA GGACAGCGGG GTGCGCATCG AGTCTCGGCA TATCGTGGTG
GACCGTTACC AGCACCGGCA GGCCCCCAAG GCCGAAATGG TGGCCGACCG TATGGGCGCA
CCGCTGGCGG CGGTCCTGCG CACCGGGGAC GGTCAGCGGC TGCGGGCCAT CAACCTGGGC
AAGACCCTGC TGGAGCTGGC CCCCTCCGAC CCCTATGCGC GGGAGGTGCA GAGCCTGGCC
CGGCAGTTGC TGCAGGGCGA TGAGGTGAGG GCCCGCAAGG GCGGGCTGGC GCGGCTGAAG
CGGCTGCTGG GAGGCCGGTG A
 
Protein sequence
MANEVAATQG AHRFVAALPE GRALEWLKLS LGEMGTVVPA ETGNLEEIRG VLDLTDTPLL 
FVWMDRHNLA QSAALVEGIL DVKSLITVIA VGEGVHQDEL LAAMRAGARD FLTVGTRASE
VRALIRRALD KAPVQPSDAA DKGRVWAVMN ARPSMANAFF CTHLAQAIQR DSRDAQVLLL
DLAIPPADSL ALLNLKSSFS FFDAVRNLKR LDRTLLVNAL PTHATGLQVL SMPDSFEDEE
EEVSTAELYL LLGSLKRYYS HLVVNLGGLP AGGFLNVMLS GADEVLQVVD QSIPSCQQNL
RRIRQVEDSG VRIESRHIVV DRYQHRQAPK AEMVADRMGA PLAAVLRTGD GQRLRAINLG
KTLLELAPSD PYAREVQSLA RQLLQGDEVR ARKGGLARLK RLLGGR