Gene GM21_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0066 
Symbol 
ID8135365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp83210 
End bp84868 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content65% 
IMG OID644867683 
ProductGeneral secretory system II protein E domain protein 
Protein accessionYP_003019911 
Protein GI253698722 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[N] Cell motility 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.36428e-24 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGGGC TTGTCAAGGA AGGATCCATC GGCGAGATCC TTTTCAAATC GCAGATCATC 
ACGGAGCACG AACTGAGGGC GGCGCTCGAA GCGCAGAAGG TCTCGGGATG CCGGGTGGGC
GAGGCGCTGG TCCGCCTGGG GGTGGTCACC CAGGAGGATA TCGACTGGGC GCTCGCCAAC
CAGCTGAACA TCCCCTACGT GCGGCTCAAG AAGGAAAACA TCGATCCCGC CGCGGTGGCG
AAGGTCCCGG GACAACTGGC CCGACGCTAC AGCCTCTGCC CCATCTTTCT CTCCGGCAAC
GAACTCTCCG TCGCCATGGC GGACCCCCTG AACAAGGAGG CGGTCGAGGA GATCACCCGG
GCGACAGGCT GCCAGATCAG CATCTCGGTG GGGCTCATCC GCGAGATCCG CGAGATGCAC
GACGCCATGT ACGGCCCGGA CCAGAACCTC CCGGAACTGG GGTTCAGCTC GGGGCACTTC
CCCGCCAAGG TCCTATCCGC CATCAACGCC GACCTCTCCG GCGCCATGCT CCTCAACCAC
CTCCTCTTGC GCGCAGTACA GCAGAAGTTC GTCTCGGTAG CGCTGCAGCC GCTGGGGGAC
CAGGTGCGGG TGCTGGCGCG CGGCGAAGGG AGGACGGCTG AGTTCGGCAA GCTCTCCGCG
ACCCACTACG GACGGCTCAC CGAGCGTATC CGCCGCCTCT CAGGCATCGA CGGCGCCGAG
GAGAACCCCT CCAGCGGCGT TTTGACCTTC ATCTGGCAGG GGAAAAGGAT CCCGTTCCAG
ACCCTGGCGA TGCCCGGCAA CGGGGGGGAT TACCTCACCT TGAAGCTGCA CGTCGGCGCG
CCGAAAATCT CCGAGCTGGA CGACCTCGGC GTCTCCGCCG CGAAACGCGC GGACCTGAAG
GCGCTCGCTT CCGAAAAGGA GGGGCTGATC CTCTTCACCG GGCGCGACCC GGAGGAGCGG
AGCCGGCTCA TGGACCTGTT CCTGGATGCC TGCGATCACG CCGACCGCAC CGTCATTTTG
GTGGGGGAAA GGCTTGGGCG CGGCAGGGAC CGGTGGCCGC GGCTTCCGGC AGGGAGATGC
GGCGCGGACG ATACCGCGAA GGTGGTGTCG GCGGCCTTGG AGCATGACCC GGACACGCTG
GTCATCGAGG ACGTCACCGA ACTGGCCTCC TTCATAGCGG CGAGCAAGGC GGTGATGCGG
GGGAAGCTCG TGGTGGCGGG GATGTCCCAG GGGAACAAGG GGGCGGTGTT GAAGCAGCTT
TTGTACCTCT CCCAGAAGAA CTTCCTGATA CCGACCCACC TGAAGGGGGT GGTTTCCTGC
AAGAGCGTGC TCCTCCTTTG CCCGGACTGC AAGAAGCGTT TCGCGCCTGC CGCCGACGAG
CTGGCGGCCT TGCGGCTTAG GGCGACGGCG CCCGAGTATT TCCGCCCGAC CGGCTGCCCC
TCCTGCGACC AGACAGGCTA TAGCGGCAAG AAATACCTCC TGGACGTGAT CCGATTCGAT
CAGGGGCTCC TGGAGGCGTT CGAGGTGATC CGCGATTCCG ACGAGATCAT CCGCCACCTC
AAAGACAACG GCTACCGCGG CATCGGCGAG GAAGGGGCCG AGCTGCTGGA GCGGGGAGAA
ATATCCCCGG GCGAGTACGT CGCTTCCATA CTACTGTAA
 
Protein sequence
MNGLVKEGSI GEILFKSQII TEHELRAALE AQKVSGCRVG EALVRLGVVT QEDIDWALAN 
QLNIPYVRLK KENIDPAAVA KVPGQLARRY SLCPIFLSGN ELSVAMADPL NKEAVEEITR
ATGCQISISV GLIREIREMH DAMYGPDQNL PELGFSSGHF PAKVLSAINA DLSGAMLLNH
LLLRAVQQKF VSVALQPLGD QVRVLARGEG RTAEFGKLSA THYGRLTERI RRLSGIDGAE
ENPSSGVLTF IWQGKRIPFQ TLAMPGNGGD YLTLKLHVGA PKISELDDLG VSAAKRADLK
ALASEKEGLI LFTGRDPEER SRLMDLFLDA CDHADRTVIL VGERLGRGRD RWPRLPAGRC
GADDTAKVVS AALEHDPDTL VIEDVTELAS FIAASKAVMR GKLVVAGMSQ GNKGAVLKQL
LYLSQKNFLI PTHLKGVVSC KSVLLLCPDC KKRFAPAADE LAALRLRATA PEYFRPTGCP
SCDQTGYSGK KYLLDVIRFD QGLLEAFEVI RDSDEIIRHL KDNGYRGIGE EGAELLERGE
ISPGEYVASI LL