Gene GM21_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3056 
Symbol 
ID8138402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3545343 
End bp3546953 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content66% 
IMG OID644870656 
ProductPeptidoglycan-binding domain 1 protein 
Protein accessionYP_003022842 
Protein GI253701653 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3267] Type II secretory pathway, component ExeA (predicted ATPase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTGGG AATCTTTCGG CTTCAAAGAG GCGCCTTTCG CGCTTACCCC CAACCCCTCG 
TTCCTGTTCC TAAGTTCTCC GCACCAGGAG GCGTTCGCGC ACCTGCTCTT CGCCATCGAG
AGCCGGGCCG GTTTCATCGA GCTTTCGGGC GAGGTGGGCA CCGGCAAGAC GACCATCGTG
CGCACGCTCT TGAACCAGCT CGACCCGGAG ACCCACCGCA CCGCGCTCAT CTTCAACCCT
ACGCTTTCGC CCCTGGGGCT TTTGCAGGAG GTCAACTCGG AGTTCGGGCT CGACTGCGTG
AGCCACGACA TGCGCGAGCT GCACACCACC CTGAACGCCT ATCTGCTGGA GGAGAACCGC
GCCGGCCGCA CCGTGGTGCT GGTGATCGAC GAGGCGCAGA ACCTCTCGGT CGAGGTGCTG
GAGCAGATCC GGCTCATCTC CAACCTGGAG ACGGACAGCG ACAAGCTGAT CCAGATCGTG
CTGGTGGGGC AGCCGGAACT GAACGCGCTT TTGTCGCGGG AGGAACTGAG GCAGCTGGAC
CAGCGCATCA CGGTGCGCTA TCACCTGAAG CCCATGTCTT TAGACGATAC CTGCGCCTAC
GTGAGGCACC GCATCAGGTT CGCCGCCGAC GGGCGGGAAC CGCTCACCTT CGCGCAAGGC
GCGTTCAAGA AGATCTTCAG CTTCTCCGGC GGCTTGCCGC GCCTCATCAA CGGGGTCTGC
GACCGTGCTC TGCTGCTCGC CTACACCAAG GAGTGCAAGG AGGTTTCAAC CGAAATGGCG
GCTCTTGCCA TAGCCGATCT GCGCAGGTCG CTTCCCCGCA GGGGGCGCGC GGTACCGATG
AAGGCCCTGT CCGCCGGGGT GGCGCTCTGC ATCCTGGCGA TCGCCGGCTT CTCGATGCTT
TCCGGGTCGC TGCTATCTAA GGAGCCTGCC GCCTTGCAGC CCCCTCCCGC ACTGTCGCGC
GAAGCGGCGC TAGCAGGTCT TGCGGCCTCC TCCGAACAGG AGAACCTGCT CGCCTCGGTG
AACGCGGTGC TGGCGGCCTG GCAGGCCCCG GCGGCGCTCG CCGCGCCGGG ACAGCCGGCG
ACGCTGCGCG GGCTCGCCCG TCAGCGCGGG ATGACGGCGA CCAAGGTGAC CGGGAACCTC
GACACGCTGG CGCGCCTGGA CGCCCCGGCG CTACTCCACC TAACGGTGCC GGGGGGAGGC
GAGAGGCTCG TCGCTCTCTT GAGGGTCGAC CGTGACGAGG TCGGCGTGGC GCCGGCCGTG
GCGGGAAAGA GGAGTCTCAC CCGGGCGGAA CTCGCGCAGA TATGGAGCGG CGGCGCCACC
TTGCTCTGGA AGGATTTCCA CGGCATCTTC TCGCGCGGCA AAGCTGGCGA GAAGGAGGCC
GGGGTGAGGC TTTTGCAGGG GCTCTTGAAA CAGGTCGGCT GCTACGACGG CGCGGTTAAC
GGCGAATTCA CCGACAAGAC CCAGGCGGCT GTCGCCGAAT TCCAGCGCCG GGAGCAGTTG
ACCGCGGACG GGAAGCTCGG GGGGCAGACC CTGATGATGC TGTATCGGCG CGCCGGAGGT
TTCTTTCCGC CGGGGCTGCA AACCTTGACC CACAACCAGG GGCGAATCTA G
 
Protein sequence
MYWESFGFKE APFALTPNPS FLFLSSPHQE AFAHLLFAIE SRAGFIELSG EVGTGKTTIV 
RTLLNQLDPE THRTALIFNP TLSPLGLLQE VNSEFGLDCV SHDMRELHTT LNAYLLEENR
AGRTVVLVID EAQNLSVEVL EQIRLISNLE TDSDKLIQIV LVGQPELNAL LSREELRQLD
QRITVRYHLK PMSLDDTCAY VRHRIRFAAD GREPLTFAQG AFKKIFSFSG GLPRLINGVC
DRALLLAYTK ECKEVSTEMA ALAIADLRRS LPRRGRAVPM KALSAGVALC ILAIAGFSML
SGSLLSKEPA ALQPPPALSR EAALAGLAAS SEQENLLASV NAVLAAWQAP AALAAPGQPA
TLRGLARQRG MTATKVTGNL DTLARLDAPA LLHLTVPGGG ERLVALLRVD RDEVGVAPAV
AGKRSLTRAE LAQIWSGGAT LLWKDFHGIF SRGKAGEKEA GVRLLQGLLK QVGCYDGAVN
GEFTDKTQAA VAEFQRREQL TADGKLGGQT LMMLYRRAGG FFPPGLQTLT HNQGRI