Gene GM21_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0891 
Symbol 
ID8136212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1062381 
End bp1064045 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content59% 
IMG OID644868507 
Producthypothetical protein 
Protein accessionYP_003020716 
Protein GI253699527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.93749e-28 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCTAA AAATGCTCCT TTTCCTCATC CTTGTTCCGG TGCTGGTAGC CGGTTGCGGC 
GGCGGCTCCA ACCAGAGCGG GACCGCGATC TCGAAACTTG CGGCGTCGCA GGGTTGCGTG
AGCCAGGACT GCCACGCCAC CTGGACCTCT CCGGGGTCAG GGGCCGTCAT TGCAGAGGAA
TGGCGCGCTT CCGCCCATAA CTTAAAGAAC GGGGCGGGAT GTGCCGACTG TCACGAGCCC
CACGCTGGGC ATCCGCAGTC CTGCTCCAAA TGCCACGGCG GCGGAAGCGG AGTAGCAATC
AGGAACCCCG ATGGTGCCGG CAAGTGCGGC AAGTGCCACG GGCTCTCCTA CCCGGGCGAC
GTCATGGTGG CGCAGGCGCC GCAGCACTTC GGCAACATTT CGGCGAGCGA ACTCAACACG
AAGTATAGGG CCTCGTACGT CAGCTCCCGT AATGTGAACA ACTGTAGGAA CTGTCACAAC
CCGCACAACC CGAGCGGCGC CATCACCATA GCGCGACAGT GGGCGCAAAG CGCGCACGGC
AATACCAAGG CCAAGGCTTA TGCGTACTAC GACTTCAAGA CGATGGGCAG CGCACAGCCA
TCATCAACCA CCTTCGAGTC GAACTGCGTC CGTTGCCACA CGACGACCGG CTATATTAAT
TTTGTCAATT CAGGCTTCGT CGACATCCAC GCCTGGGGCA GCGGCAGCGA CAAGACGAAG
GAAGTGACCG CCTGTAATGC CTGCCATGAT GACGGTGCCG GTCGCACCTA TGGCTACGGT
CTGCGCAACG TCGCGGTCGT CAGCATTTAC TACAACTACT CCTCATCAAA GAGTTCTCCA
ACGGTGAAGC TCAATAACAA CAAAACCTTG TACCCTGATG CAGGCGCATC CAACCTCTGC
ATGCCTTGCC ACACAGGCAG GGCTGTCGGA CAGATGATTA AGGACGCGGC TGCGCTTGGA
CTTAACTTTG CCAACGTCAA CATGCCGAAC GGCCATTACC GGTCCGCGGG GGCAACCGTT
TTTCAACTGG GGGGCTACGA GTTCGTCGGG AGAAGCTACT CCAACGCCTC CTTCCTTCAC
TCTTCAATCG GCCTTGGCAA CAACCGCGGC ACCGGCGGCA AAGGCCCCTG CATCACCTGC
CACATGACCA ACGGCACCTC GCACCTTTTC ATGCCGGTGA CACTGGATGA TGTCAAGGCC
GTCACCGGGG TGGTAAGCGC GACCTGCGTC AAGTGCCACG ACAGCAGCTT CCAAACCAGC
CACACGGCCG TTTCGCTGCA GGTTCGCAAG GCGGGGTATG GCGCAGCATT GGCCATGCTT
AACATCATCA AAACCGGCAA GTCGACCAGC ACCGACTGGG ATACTTTCGA CCCAGGCAAC
GGAGCCAACA CCATGGGGGC ATCTTTCAAC TACAACCTGC TTTCGAGTGA ACCGGGAGCC
TACGCCCATA ACCCGCTCTA CACCAAGCGG CTCATCTACG ATTCCATCGA CTGGATCTCT
AACGCGGGCA TGGACGACGA CGTGGCGGCC GCCATAAGTG CTGCGACACT TCCAGGCTCG
ATAACCAATC CGATAACCAA GATCGCTTAT ACTCCGGCGG AGGTTGCAGG GTTGAAGAGT
CTGGCCATCG CCTACCTTAG TGGAAGCGGC GGCGGGCGTC CCTAA
 
Protein sequence
MRLKMLLFLI LVPVLVAGCG GGSNQSGTAI SKLAASQGCV SQDCHATWTS PGSGAVIAEE 
WRASAHNLKN GAGCADCHEP HAGHPQSCSK CHGGGSGVAI RNPDGAGKCG KCHGLSYPGD
VMVAQAPQHF GNISASELNT KYRASYVSSR NVNNCRNCHN PHNPSGAITI ARQWAQSAHG
NTKAKAYAYY DFKTMGSAQP SSTTFESNCV RCHTTTGYIN FVNSGFVDIH AWGSGSDKTK
EVTACNACHD DGAGRTYGYG LRNVAVVSIY YNYSSSKSSP TVKLNNNKTL YPDAGASNLC
MPCHTGRAVG QMIKDAAALG LNFANVNMPN GHYRSAGATV FQLGGYEFVG RSYSNASFLH
SSIGLGNNRG TGGKGPCITC HMTNGTSHLF MPVTLDDVKA VTGVVSATCV KCHDSSFQTS
HTAVSLQVRK AGYGAALAML NIIKTGKSTS TDWDTFDPGN GANTMGASFN YNLLSSEPGA
YAHNPLYTKR LIYDSIDWIS NAGMDDDVAA AISAATLPGS ITNPITKIAY TPAEVAGLKS
LAIAYLSGSG GGRP