Gene GM21_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3421 
Symbol 
ID8138788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3953747 
End bp3956155 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content61% 
IMG OID644871038 
Productouter membrane protein assembly complex, YaeT protein 
Protein accessionYP_003023203 
Protein GI253702014 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones110 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCCG TTCAGCTCCT GGTGTGCCCG ACGCTGTCGT TCGCCCAAGG CGCGCACCAT 
GGTCCGTCCG CCGCGCCGGC CACGTCAGCC CAGCCCGTTC AGGCTGCCGC CCAGCCTGCG
CAACCGGCGC CTGCCCCGGC GCAACAGGCG GCAGCCCCCC AGTCCGGCGC CGAGAAAATC
ACCGCGGTCA AGATCAGCGG CAACCGGCGC ATAGAGACCG CCGCCATCGT GCCGGCGCTG
CAGGTGAAGG CCGGCGAGGT GCTGGACCCG GCGAAGGTTG ATGCGGATCT GAAGGCGATT
TACCAGTTGG GGTACTTCAA GGACGTGAAG GCGCAGACTG AAGCCGCGGA CGGGGGCGTG
GTCCTCGAGT ACGTCGTGCA GGAAAAGCCG ATAGTGCGCG AGGTGAAAGT CGAGGGGGCG
AAGGAGTTGA CCCCTGAGAA GGTGCGCGAC GCGGTGGAGA TCAAACCCAA CGCCATCTTC
TCCCAGAAAG ACCTGCAAAA GAGCGTCAAG AAGGTGAAAA AGCTCTACGC TGACGAGGGG
TACTACCTCG CCGAGGTGAC CGGGGACATA AGCATGCGCT CGGACACCGA GCTGAACGTC
ATCTTCCGCG TCAAGGAAGG GGACAAGGTC CTCATCAAGG AGATCCGCTT CGAGGGGAAC
AAGGCCTTCA AGGCGAAAAA GCTGAAAAAG GCCATGGAGA CCTCGGAGAA ATGGATATTC
TCCTGGCTCA CCGGCGCCGG TACCTACAAG GAAGAAGCCC TTAAAAACGA CATGGCGCTT
CTCACCGAAC TGTACATGAA CGACGGCTAC ATCAACGTGA AGATAGGGGA GCCGAAGGTG
GAGCTTACCC CGGACCGGAA AGCGCTCAGG GTCACCATCG GGATCACGGA AGGGGAGCAG
TACCGCATCG GCAAGCTCGA CTTCAAGGGA GACCTGCTGG AGAACCGGGA CGTACTCTTC
GGCAAGCTGA AGGAGAAGAG CGGGGACATC TTCAGCCGCA CCAGCCTGCG GGCTGACATC
TTCACCCTTA CCGACTTCTA CGCGGACAAG GGATACGCCT TCGCCAACGT CGCCCCGATC
ACCGAGTTGA ACCGGGAGCA GCGCATCATC GACATCAGCT TCGACCTCGA GAAGGGGGAG
AAGGTGAACA TCGACCGGAT CAACATCACC GGCAACACCA AGAGCCGCGA CAAGGTGGTG
CGGCGCGAGT TGAGGCTCGC CGAGGGAGAG CTTTACAGCT CCACCGCGCT CAAGCGCAGC
AAGCAGAACC TGATGAACAC CGGCTTCTTC GAGGAGGCCA ACCTGGTGAC GGCAAAGGGG
AGCGCCTCCA ACAAGCTCGA CCTGAACGTC GAAGTGAAGG AGAAGCCGAC CGGCACCTTC
AGCATCGGGG CGGGCTACAG CTCGCTGGAC GGCATCATAG GCCAAGGGTC GGTGCAGCAG
GCCAACTTCC TCGGCCTCGG GCTCAAGATG ACCGCGGCCG CCTCCTTCGG CAGCAAGTCA
CAGACCTACA ACCTCGGGCT CACCGATCCG TACTTCATGG ATACCAAGTG GACCATCGGG
GGCGATCTCT ACCGCAACGA GCGCCAGTAC CTCGACTACA CCCGCCGCGC TACAGGCGGC
GACATCAAGG CGGGGTACCC CCTCTCGGAT ACCTTGAGCA CCTTCTGGCT CTACAAGTAC
GAGCAGAAGG AGATCTTCGA CGAGTCGACC GAGCTGCAAA TCAACGTCAA CAAGGGGACC
ATCCTGGCCC CGGACAAGTC ATCGACCACC AGCGCCATCG TCGCCAGCAT CACCAGCAAC
ACCACGGACT ACCGGCTCGA TCCCACCACC GGGATGATGA ACACGCTCTC CGTAGAGTTC
GCCGGGCTCG GCGGCACCAA CCGCTACGTA AAGACCATCA CCGAGCATAC GCTGTTCCAT
CCTCTGCTCT TCGGGGTGGG ATCGGTGCGG GGAACGCTCG GGTACGTACA GGGGTTCGGC
GGCAAGGAAA TACCGATCGA CGAGAGGTTT TATCTCGGGG GCATCAGCTC GTTGCGCGGC
TACTCCTCGC GTACTGTAAG CCCGTACAAG ACCACCGAGA TTCATAACAT AGATGGCGGC
GGTTATGTAC CGGGGACCAG CTTAAGCCGC GTCTACCTGG GCGGCGAGAT GGAGGCGGTG
GCGAACGCCG AGTACACCTT TCCGCTGCTG AAAGAGGCGG GGCTTAAAGC CGTCCTCTTC
TTCGACGCGG GGAACTCTGC CAACAGCCTC AACGACACCT TCGGCAACAT CCTGACCAGC
TACGGCGCCG GCATCAGGTG GTTCTCCCCG ATCGGGCCGC TGAGACTCGA GTACGGCATC
CCCATCAACC CAAGACAAGG AATAGACTCA GGCGGAAAGT TGGAGTTCTC CATAGGGAGT
ATTTTCTAA
 
Protein sequence
MLAVQLLVCP TLSFAQGAHH GPSAAPATSA QPVQAAAQPA QPAPAPAQQA AAPQSGAEKI 
TAVKISGNRR IETAAIVPAL QVKAGEVLDP AKVDADLKAI YQLGYFKDVK AQTEAADGGV
VLEYVVQEKP IVREVKVEGA KELTPEKVRD AVEIKPNAIF SQKDLQKSVK KVKKLYADEG
YYLAEVTGDI SMRSDTELNV IFRVKEGDKV LIKEIRFEGN KAFKAKKLKK AMETSEKWIF
SWLTGAGTYK EEALKNDMAL LTELYMNDGY INVKIGEPKV ELTPDRKALR VTIGITEGEQ
YRIGKLDFKG DLLENRDVLF GKLKEKSGDI FSRTSLRADI FTLTDFYADK GYAFANVAPI
TELNREQRII DISFDLEKGE KVNIDRINIT GNTKSRDKVV RRELRLAEGE LYSSTALKRS
KQNLMNTGFF EEANLVTAKG SASNKLDLNV EVKEKPTGTF SIGAGYSSLD GIIGQGSVQQ
ANFLGLGLKM TAAASFGSKS QTYNLGLTDP YFMDTKWTIG GDLYRNERQY LDYTRRATGG
DIKAGYPLSD TLSTFWLYKY EQKEIFDEST ELQINVNKGT ILAPDKSSTT SAIVASITSN
TTDYRLDPTT GMMNTLSVEF AGLGGTNRYV KTITEHTLFH PLLFGVGSVR GTLGYVQGFG
GKEIPIDERF YLGGISSLRG YSSRTVSPYK TTEIHNIDGG GYVPGTSLSR VYLGGEMEAV
ANAEYTFPLL KEAGLKAVLF FDAGNSANSL NDTFGNILTS YGAGIRWFSP IGPLRLEYGI
PINPRQGIDS GGKLEFSIGS IF