Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2161 |
Symbol | |
ID | 4270156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2456424 |
End bp | 2457644 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638126917 |
Product | pilus assembly protein CpaE |
Protein accession | YP_742993 |
Protein GI | 114321310 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4963] Flp pilus assembly protein, ATPase CpaE |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0776455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0348695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAATG AGGTGGCGGC GACCCAGGGC GCCCATCGGT TCGTGGCCGC CCTGCCGGAG GGGCGGGCGC TGGAATGGCT GAAGCTCAGC CTGGGGGAGA TGGGCACGGT GGTGCCGGCG GAGACCGGTA ACCTGGAGGA GATCCGCGGG GTGTTGGACC TGACCGACAC ACCGCTGCTC TTCGTCTGGA TGGACCGCCA CAACCTGGCA CAGTCCGCAG CCCTGGTGGA GGGTATCCTC GACGTCAAGT CCTTGATCAC CGTGATTGCG GTGGGGGAGG GGGTGCACCA GGACGAACTG CTGGCGGCCA TGCGGGCGGG GGCGCGGGAC TTTCTCACCG TGGGCACCCG GGCCAGCGAG GTGCGGGCCC TGATCCGCCG GGCCCTGGAC AAGGCCCCGG TGCAGCCCAG CGATGCCGCC GACAAGGGGC GGGTCTGGGC GGTCATGAAC GCCCGGCCCA GCATGGCCAA CGCCTTTTTC TGCACCCATT TGGCCCAGGC CATCCAGCGG GACAGCCGGG ATGCCCAGGT CCTGTTGCTG GACCTGGCGA TCCCGCCGGC CGACTCCCTC GCCCTGCTCA ACCTCAAGTC CTCCTTCTCC TTTTTCGATG CGGTCCGCAA TCTGAAGCGG CTGGACCGGA CCCTGCTGGT GAACGCCCTG CCCACCCACG CCACGGGGCT GCAGGTGCTC TCCATGCCGG ACTCCTTCGA GGACGAGGAA GAGGAGGTGA GCACCGCCGA GCTCTATCTG CTCCTGGGCT CGCTGAAGCG CTACTACAGC CACCTGGTGG TGAACCTGGG TGGGTTGCCC GCCGGCGGGT TCCTGAATGT CATGCTGAGT GGTGCGGACG AGGTGTTGCA GGTGGTGGAC CAGAGCATCC CCAGTTGCCA GCAGAACCTG CGCCGGATCC GCCAGGTGGA GGACAGCGGG GTGCGCATCG AGTCTCGGCA TATCGTGGTG GACCGTTACC AGCACCGGCA GGCCCCCAAG GCCGAAATGG TGGCCGACCG TATGGGCGCA CCGCTGGCGG CGGTCCTGCG CACCGGGGAC GGTCAGCGGC TGCGGGCCAT CAACCTGGGC AAGACCCTGC TGGAGCTGGC CCCCTCCGAC CCCTATGCGC GGGAGGTGCA GAGCCTGGCC CGGCAGTTGC TGCAGGGCGA TGAGGTGAGG GCCCGCAAGG GCGGGCTGGC GCGGCTGAAG CGGCTGCTGG GAGGCCGGTG A
|
Protein sequence | MANEVAATQG AHRFVAALPE GRALEWLKLS LGEMGTVVPA ETGNLEEIRG VLDLTDTPLL FVWMDRHNLA QSAALVEGIL DVKSLITVIA VGEGVHQDEL LAAMRAGARD FLTVGTRASE VRALIRRALD KAPVQPSDAA DKGRVWAVMN ARPSMANAFF CTHLAQAIQR DSRDAQVLLL DLAIPPADSL ALLNLKSSFS FFDAVRNLKR LDRTLLVNAL PTHATGLQVL SMPDSFEDEE EEVSTAELYL LLGSLKRYYS HLVVNLGGLP AGGFLNVMLS GADEVLQVVD QSIPSCQQNL RRIRQVEDSG VRIESRHIVV DRYQHRQAPK AEMVADRMGA PLAAVLRTGD GQRLRAINLG KTLLELAPSD PYAREVQSLA RQLLQGDEVR ARKGGLARLK RLLGGR
|
| |