Gene GM21_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1682 
Symbol 
ID8137013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1962396 
End bp1964165 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content64% 
IMG OID644869294 
ProductRhs element Vgr protein 
Protein accessionYP_003021494 
Protein GI253700305 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones110 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACTG AGTCGACCAT CCCCACGCCG GCGACGCCGG ACGTCTGCAC GATCGATCTC 
CTCGTCGAGG GAAGCGCGAT TCCCGGCGAG TACCACGTCC TTTCAGTTGC GGTGAGAAAG
GAGATCAACC GCATCCCGAC GGCTACCCTG GTGCTGCGCG ACGGGGAGGC AGCGAAGGCC
ACCTTCCAGG TCAGCAACAG CGATCATTTC CTACCAGGAA ACAAGGTGGA GATCAGGCTG
GGATACCGGT CGAACAACGA AACGGTGTTC AAGGGGGTGG TGATAAAGCA GGGGATCAGC
ATCCGCAAGA GCGGCAGCAT CCTGACGGTT GAGTGCCGCG ACGAGGCGGT GAAGATGACC
TGCGGCGCCA AGAGCCGCTA CTACACAGGC ATGAAGGACA GCGACATCTT GGAGCAGATC
ATCGCTTCGT ATCGGCTCGA CAAGGACGTG CAGGCGACCA AGCCCGACCT TAAGGAAGTG
ACGCAGTACA ACGCGACCGA CTGGGATTTC CTCCTCTGCC GGGCGGAGGC CAACGGCCAG
GTGGTGATCG TCAGCGACGG CAAGGTGAGC GTAACCCAGC CTGCCGCAAG CGAGGAACCG
GTCCTTTCGG TGGGGTACGG CAGCACCCTT TTGGAGCTGG ACGCGGAGAT AGACGCGCGC
AGGCAAAGCA CGGGGATCGT GGCCCGCAGT TGGAGCGGGA CGGACCAGGA CGTGCTGGAA
GCCGAGGCGA AGGAGCCGGC GAAGACCGTG GCGGGAAACC TGGCCCCGGA CACGCTGGCA
AAGGTTTTGG GGGGCGACCC CCACGAGATG AGGCACGAAG GCAAACTCAC CACCCCCGAA
CTGCAGGCGT GGGCCGACGG GCGGCTCCTC AGGGAGCGCC TGGCCAAGGT GCGCGGCAGG
GCGAAGTTCC AGGGGTTCGC CAAGGTGGCT CCGGGGAAGG TCATGGAGGT GAGCGGCATC
GGCGAGCGGT TCCAGGGGAG GTTCTACGTG GCGGGCGTGC GCCACGTGGT GGACAAGGGG
AACTGGGAGA CCGACGTGCA GTTCGGTTTG AGCACCGAGA CCTTCGCCGA GACCTTCGAC
CTCCGCCCGC TCCCCGCATC GGGGCTTCTT CCGGCGGTGA GCGGACTGCA GATGGGAGTG
GTGACGGTCC TGGAGAACGA CCCGCAGGGG GAGGACCGGA TCAAGGTCCG CCTGCCGCTG
GTGAACAAGG CGGAGGAAGG GCTCTGGGCG CGGCTGGCGA CGCTCGACGC TGGCAACAAG
AGAGGGACCT TTTTCCGCCC CGAGGTCGGC GACGAGGTGG TGGTCGGTTT CCTGGGGGAC
GACCCCTGCC ACCCGGTGGT GCTGGGGATG TGCCACAGCA GCGCGAAGCC CGCCCCGGAA
CCCGCCAAGG ACAAGAACCA CCGCAAAGGG TACGTCAGCC GGTCGAAGCT CAAGTTCACC
TTCGACGACC AGAACAAGGT GGTGCTCCTG GAGACGCCGG GCGGCAACAG GCTGGCGCTT
TCGGAGGCGG ACAAGGGGAT CGTCATCAAG GATCAAAACG GCAACAAGAT CATCCTCGAC
AACACCGGGG TGCGCATAGA GAGCAGCAAG GACCTGACAC TTAAGGCGGC GAAAAACGTG
AACATCGAGG CATCGGCCCG CCTGAATCTG AAGGCGCAGA CCTCCTTCAA GGCGGAGGGG
GCTGCCAGCG CAGAGGTCTC GGGCGCAAGC ACCACGGTCA AGGGAAGCGC CAAGACGGTG
ATTCAGGGGG GGATCGTGCA GATAAATTAG
 
Protein sequence
MSTESTIPTP ATPDVCTIDL LVEGSAIPGE YHVLSVAVRK EINRIPTATL VLRDGEAAKA 
TFQVSNSDHF LPGNKVEIRL GYRSNNETVF KGVVIKQGIS IRKSGSILTV ECRDEAVKMT
CGAKSRYYTG MKDSDILEQI IASYRLDKDV QATKPDLKEV TQYNATDWDF LLCRAEANGQ
VVIVSDGKVS VTQPAASEEP VLSVGYGSTL LELDAEIDAR RQSTGIVARS WSGTDQDVLE
AEAKEPAKTV AGNLAPDTLA KVLGGDPHEM RHEGKLTTPE LQAWADGRLL RERLAKVRGR
AKFQGFAKVA PGKVMEVSGI GERFQGRFYV AGVRHVVDKG NWETDVQFGL STETFAETFD
LRPLPASGLL PAVSGLQMGV VTVLENDPQG EDRIKVRLPL VNKAEEGLWA RLATLDAGNK
RGTFFRPEVG DEVVVGFLGD DPCHPVVLGM CHSSAKPAPE PAKDKNHRKG YVSRSKLKFT
FDDQNKVVLL ETPGGNRLAL SEADKGIVIK DQNGNKIILD NTGVRIESSK DLTLKAAKNV
NIEASARLNL KAQTSFKAEG AASAEVSGAS TTVKGSAKTV IQGGIVQIN