Gene GM21_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1523 
Symbol 
ID8136852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1784443 
End bp1785681 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content67% 
IMG OID644869135 
Producthypothetical protein 
Protein accessionYP_003021337 
Protein GI253700148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value5.03932e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGCCC GGGACCGCTT CAGTAAGCTC GGGGGGATCG CCCTGGAGAC GGTGGGATCG 
CTGCTTCTGG TCCTCGCGCT CTACTGGCTG TTCATGGTCT TTCTCTACGC ACTTTTCCCC
TCAGGCACGC CGCTGAAGGA GATGCTGGCG AATCGCGCGG AAGAGCTTCC GGTGAAGGAC
GGCGCGGGGC GGCGGCCGGA GGCGGCGCTC AGATCCCTGG TGCGCGACGT CCGCTTCAGG
CGCGGCAACT CAGTCGCCTG GGAGGGAGCC AGGGAAGGGA TGCTTCTTTA CAACAACGAC
GCCGTGCAGA CCTTCGACCG CTCCGGCGCC ACCTTGTCCT TCGCCCCCAG CGACCGCCTC
ACCGTGGGGA GCAATTCCCT GGTGCTGGTC ACCCGCCTGA ACGAAAAGGT GGAGGGGGAG
CCTAGGGCCT ACCGGGTCCA GGTGGAAGGG GAGCTGCAGG GAAGCCTTTC CGACGAGAAG
CGACTGCGGC TGGAGATCGC CACGGCGGGG CATCTGGCCC GCGTTGCTTC CGGCGCGGCC
AGGTTCAGCG TCACCCCCAA CGGCAACGCG TCGAGCCTCG CCGTCTACGC CGGGGAGGTC
TGGGTCCAGG GCAGGGACGG GATCGTCCGC GTGCCGGCCT ATCACGGCAT CACCCTGAGA
AAAGGGGTCG CGGCGGGGCC GGCGGTGCCG CTTCCCGAGG CGCCGTCGCT CAAAACGGAG
AAGCTCCTCT ACAGATTCCG CCTGCTTCCG CCCAAGGTCC GTTTCTCCTG GAGCGGCAAG
TCGGGCGAGT ACCACTTCCA GCTGGCGAGA GACCCCCGCT TCAAGAGCCT GGTGCTGGAT
AAGAAGCTCG CCGCGCCGGA ACTCGTCACC GGAACGCTGG AAGCCGGAAG CTACTTCTGG
CGGGTGAGCG GGGTCATGGA GGCCAGGGAA GGGTTCTTCA GCCGCACCGG GCGCTGCGAC
CTGCTGCAGC TTTTGAAACC GCCCGAACTC AAGGTGGAAT TCCCGCCTCA AAGCGCCGCT
GCCGGAAACT TCACCTTGAC CGGCAGCGTC GAGCCCGGCG CGCGGGTCTT CGTGAACGGC
GTCGAGGTGT CCGGCGCCGG CGACGGGGCT TTTGCCCACG ATCTCAGGCT GAAAAGCGGC
GTCAACCTGA TCAGGGTGGA GGCCGTCGAC CAGGCGGGAA ACGCCAGTTA CGCCTCCCGG
GTCGTCTACG GGGCAGGTGC CGGACAGCAA GACAGATAG
 
Protein sequence
MSARDRFSKL GGIALETVGS LLLVLALYWL FMVFLYALFP SGTPLKEMLA NRAEELPVKD 
GAGRRPEAAL RSLVRDVRFR RGNSVAWEGA REGMLLYNND AVQTFDRSGA TLSFAPSDRL
TVGSNSLVLV TRLNEKVEGE PRAYRVQVEG ELQGSLSDEK RLRLEIATAG HLARVASGAA
RFSVTPNGNA SSLAVYAGEV WVQGRDGIVR VPAYHGITLR KGVAAGPAVP LPEAPSLKTE
KLLYRFRLLP PKVRFSWSGK SGEYHFQLAR DPRFKSLVLD KKLAAPELVT GTLEAGSYFW
RVSGVMEARE GFFSRTGRCD LLQLLKPPEL KVEFPPQSAA AGNFTLTGSV EPGARVFVNG
VEVSGAGDGA FAHDLRLKSG VNLIRVEAVD QAGNASYASR VVYGAGAGQQ DR