Gene GM21_3565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3565 
Symbol 
ID8138937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4140021 
End bp4143353 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content64% 
IMG OID644871184 
Producthypothetical protein 
Protein accessionYP_003023344 
Protein GI253702155 
COG category 
COG ID 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones121 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAA AAATGGTCAT GATCCTAGTG TTGCTTCTGG CCATGCCGGC CTTTGCCAAC 
GCATGGTATG TCAATGCGAA GACTTCGCCG ACCAGCGGTG CGGGCACCAT CACCCCGTCG
GGCAACAGGA CCTATGCCGC GGGGGTGAAC AGCGAGGAGT TCACCGTCAC TCCGGCTCCG
GGCTACACCC TTTCCCGCGT CACGCTCGAC GGCGTCGCCA TCGCCCCCAA CGCAAACGGG
AAGTACGTCG CACCTTACGT CTCGACCCTG ACCTGGCGCT ACATGGTGGC AGTCTTCTCG
GCCGGCACGG TGAACATCAC TACGAGCGTT ACCAGTGGGG GCGCCATTCG CGAGGCCAAC
TATCTCTCGC TCACCTCCAT CCCGGTCGGT TCCACCCGGA CCCTCTTGGT ACTTCCGAAC
AGCGGCTACG AAATCAGCAC CTTGACCGCC AGCGGCACCC CTTCCGTAAC CGTCCAGGGG
GATGGCTCCC GGTTGGTCAC CTACAACAAC CTGCAGGCGA ACCAGAGCGT CACTGCCGGC
TTCTCGCTGA TCCCGATCGT GGTCGCCAAC GCCGGCAGCG ACGTCACCAC TACGGGCACA
GGCGCGGCTT ACGCGACGAC CCTCTTCGGC AGCAACAGCA GCAGCAACCA GGGGGCGATC
AACTACCAGT GGAGCGGACC GGCGGCTTTG ATGTTCGGCT CGCCCACTTC CGCCAATACC
ACCGTATATT CGGACATCCC CGGGGATTAC ACGGCGACAC TCACCATCAC CTCGAACGGC
ATCACCCGTT CCGATACGGC CATCGTGCAT GTCGTCACTC GGAACTCCTA TCTCGTGTCT
CAGTGCACCA CGTGCCATTC CGGAAACACC ACGGCGCTGG TCGGCCTTTA CAACAGCTCG
CCGCATCTTG AAACCAACGC CTGCCAGGGC TGCCACACCG ACAGCCCGCA CGTGGCGCTG
CCGTCGCCCA ACGTCTGCGC CGATTGCCAC ACGGACACCT CGCGCCATCC CTTCGAGATC
ACCGACACCT GCACCTCCTG CCACAACTCG CACTCGACCG TTGTCGGGGC GACGGCGATG
CCGGACGGTT TGCATTACAA CAACATCACC ACCGGCATGT ACCCCGCCTC CTTCGTGACC
TCGCGCGCCG CCTGCGCCAA CTGCCACAAC AACACGATTT CCAACAAGGC GATCAGGATA
CAGTGGCGCG CGGCGCGCCA CGCTAACATC ACCTCCATGG GGTGGATCGC CAGGGACTTC
AAGACCCTTA ACGGCTGCGT GCGCTGCCAC ACCACCACCG GCTTCATCGC CTACTCCACG
GGCAAGGTGA CGGCGGCGTG GGGCGTGGCC GCAGACAAGA CCAAGGAAGT ACTTACCTGC
ATCGGCTGCC ACAGCGACAG CGTGTCCGGG GCGGTGCGCA CCATGGCGCA GGTTGCGCCG
TACCCCGACA ACAGCTTCGT CACCCCCGAC ACCGGCAAGT CCAACGTCTG CCTCCCCTGC
CACTCGGGCA CCAGCAGCGG CGAGAGCATC AAGGCGCTTT TGCAGGCCCA GGCAGACTTC
GGCAACACAG GCTTCGTGAA CCCGCACTAC AAGGCGGCCA CCGGCTCCCT CTACGGCGTG
GCCGGCTACC ACTTCTCGGG GCGCAGCTAC ACCACCGAAG CGACCCACAA CCATCTGGGC
ATCAGCGACG GCGGCGGGGC CTGCGTCTCC TGCCACAGAA ACAGCATGAA CGGCCACACC
TTCCAGGGTG AGGTGACCCC GGTCTGCGCC ACCTGCCACG GCACCAGTCT GGACGAGGCC
TCCCTGCACG TGGACCACAA CTACTTCCTG AACGCCCTCG AGGTGCTGAG GGCTCAGTTG
GCCGCCAAAG GCTACGCATA TTCGCTCACC AGCCGCAGCT TCAACGCCAC CAACTGGGGC
GCCGGTCAGG GGGGCGCCGA CACAATGGGC GCCGCGTTCA ACTACGCGCT GCTCGTCTCC
GAGCAGGCCG CCTTCGTGCA CAACCCGAAA TACGCCCAGG AACTGGTGAT CGATTCCATC
GATTACCTGG ACAACGGCCA GTTCGACGAT TCAGTGGCCG GCACCGTGCA GGCCCTGCTT
GACTCGGGAG CGATCAGCCA GGAGGTCGCC GACAGTTTCG GCACCTACAA GCAGAAGAAC
ATCTGCCTCT CCTGCCACGG CGGCGACGCG ACCACCTCGC GCCCCATGGC CAGCAACGGT
CATCCTTCGC ACCTGAGCGG GGCCTGGGGG CCGCAGGATT ACCTGCGCAC GCAGAGCTCG
AGCTGCGAGC CCTGCCACGG CAACGACTTC GCCCTGCACT CCAACGGCAC CGTCAACGTG
TCGAGCGACG CGGGCTCCTC CTGCGCAAAC TGCCATGCCG GTTCGGTTCC CGCCTGGAAC
TCCACGGCCC GGATCGCCTG CGAGGTCTGC CACTCGGCGA ACCCGGCAAG GCTTCCCAAC
GGAGTGGCGG CGCCTTCCAA GGAGAGTTTC GCCACCTCCG GCCACGGGCA GTTCGGCGCC
AGCAACCAGT GCACCATCTG CCACAACCCC GACAGCCGCC ACATCTCCGG CAGCCTCGGA
ACCTACAAGA GGCTGAAGCT GCAAAACGAC AACAACCTCT GCGCCTCCTG CCACGACAGC
GTGGTGGCGC AGCAGATGCC GACTCACCAT GGCCTTGCCT GCGTGCAGTG CCACGATCCG
CACGGAAACG GCAACATCAA GATGGTACGG GGCACGATCG GGACGCAGAG CATCACCTAC
CTGAACTCCT TGAACAACTT CGTGGACCAG ACTACCAACA AGGGGCTTTG CCAGGTCTGC
CACAGCTCCA CCCGCTACTA CCGCGCGGGG ATCAGCGAAA GCAGGCACTA CACTACTGGG
TGCCTTGGCT GCCACTTCCA CTACAACCCC GACGGCGCCT TCTTGCCCAG CGGCGGCGCC
TGCGACTCCT GCCACGGTTA TCCGCCCGCT CCGAAGAACA CGGCGACCAG CTTTGGCAGC
GACTCCAACT GGGCCAACGC TCGCTACGAG GACTATTCCG GAGGCGGCGG CGCGCACCTG
GTGGCGGCCC ACGTCTCCCC CTTCGCCACG ACGACCGACG GCTGGAGCAA CTGCACCGTC
TGCCACAACG GCGGCTACCA TGAAATGACC ACACCGGTGG CGGAACACAT CGGCAACGTC
ACCGTGATGG TGGACAACAG CCTGCGCTTT GCCGACAGCT TCACGGTCTA CACCGGCGCG
AAGCTGACCA ACGCGGGGCC GAACGCCACG GGAAGCTGCT TCAACATCGC CTGCCACATG
AGCCCGTCAA TAAGGTGGAG TACGGAAAGA TAG
 
Protein sequence
MFKKMVMILV LLLAMPAFAN AWYVNAKTSP TSGAGTITPS GNRTYAAGVN SEEFTVTPAP 
GYTLSRVTLD GVAIAPNANG KYVAPYVSTL TWRYMVAVFS AGTVNITTSV TSGGAIREAN
YLSLTSIPVG STRTLLVLPN SGYEISTLTA SGTPSVTVQG DGSRLVTYNN LQANQSVTAG
FSLIPIVVAN AGSDVTTTGT GAAYATTLFG SNSSSNQGAI NYQWSGPAAL MFGSPTSANT
TVYSDIPGDY TATLTITSNG ITRSDTAIVH VVTRNSYLVS QCTTCHSGNT TALVGLYNSS
PHLETNACQG CHTDSPHVAL PSPNVCADCH TDTSRHPFEI TDTCTSCHNS HSTVVGATAM
PDGLHYNNIT TGMYPASFVT SRAACANCHN NTISNKAIRI QWRAARHANI TSMGWIARDF
KTLNGCVRCH TTTGFIAYST GKVTAAWGVA ADKTKEVLTC IGCHSDSVSG AVRTMAQVAP
YPDNSFVTPD TGKSNVCLPC HSGTSSGESI KALLQAQADF GNTGFVNPHY KAATGSLYGV
AGYHFSGRSY TTEATHNHLG ISDGGGACVS CHRNSMNGHT FQGEVTPVCA TCHGTSLDEA
SLHVDHNYFL NALEVLRAQL AAKGYAYSLT SRSFNATNWG AGQGGADTMG AAFNYALLVS
EQAAFVHNPK YAQELVIDSI DYLDNGQFDD SVAGTVQALL DSGAISQEVA DSFGTYKQKN
ICLSCHGGDA TTSRPMASNG HPSHLSGAWG PQDYLRTQSS SCEPCHGNDF ALHSNGTVNV
SSDAGSSCAN CHAGSVPAWN STARIACEVC HSANPARLPN GVAAPSKESF ATSGHGQFGA
SNQCTICHNP DSRHISGSLG TYKRLKLQND NNLCASCHDS VVAQQMPTHH GLACVQCHDP
HGNGNIKMVR GTIGTQSITY LNSLNNFVDQ TTNKGLCQVC HSSTRYYRAG ISESRHYTTG
CLGCHFHYNP DGAFLPSGGA CDSCHGYPPA PKNTATSFGS DSNWANARYE DYSGGGGAHL
VAAHVSPFAT TTDGWSNCTV CHNGGYHEMT TPVAEHIGNV TVMVDNSLRF ADSFTVYTGA
KLTNAGPNAT GSCFNIACHM SPSIRWSTER