Gene GM21_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1658 
Symbol 
ID8136989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1928366 
End bp1930027 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content55% 
IMG OID644869271 
Producthypothetical protein 
Protein accessionYP_003021471 
Protein GI253700282 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones177 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGACG ATCGACACCC GCTTACCCTG AGAGAATACG GCGCCACTGT CGCCATGCTT 
CTCCTGGGAA TGTACTTTAT ACCTCTGTCG GTCATGCATC TCGACCTGTC CCGCATTCCC
GGCGACCTCG TGGACTCACG GCTAAACAAC TATTTTCTTG AGCACGGCTA CAGATGGGTG
ACGGGGCAGG TCGCGAGTTT CTGGAACGCT CCGTTCTTCT TCCCTGCGCC CCAGGTGATG
ACTCTTTCTG ACAATCACCT GGGCACGCTC CCCCTGTATT CGCTTTTTCG CTTACTTAAT
TTCGATCGAG AAACCGCCTA CCAGTTGTGG ATGCTGACTC TTTTCGTTCT CAACTATCTT
TGCGCGGTCG TTGTTTTGAA TCTGATGCGC TTCAACCCGG TGGGGGCCGC TGCGGGCGGG
TATTTCTTCA CCTTTTCGCT TCCCATGTCA GCCCAGCTCG GCCACATACA ATTGCTGCCC
CGCTTCATGA TCCCGCTGGC GTTCTATTTC GTACTCAGAT TTCTCCAGCA AAAGAGGAAC
CGCGACCTGG CGGCTGCCTG CGCATGCGTG GTAATGCAGT TTTATTGCGT CATCTATATC
GGCTACTTCC TGTCACTGGG GTTGGCTTTC TTCTTCGCAT CATATCTGAT CCTGGCCTGG
GACAGGATGG AGATCCGCGC CATCTTGTGG GGGTCGAGAA AGGAGTTCTC CGAGAGAGCG
CTGGCATTGG TAGGATCGGG AGCAGCCCTG ATGCCCCTTA TGTATCCCTA CTATCTGCGC
TCCATCTATT CCGAGCCAAT CTCCTGGGAC GTCATCTCCT CGATGCTGCC CAGGGCTTAC
TCCTACTTTT ACACCTTCAA CGAAAGCTTG ATGTGGAGCT GGCTGTCTGG GCTTGGTAAC
GCTCTACCAA TGGCGCATGA GCACCGCCTT TTCGTGGGGC TCCTTCCCCT TCTGGCCTTG
ATGCTGCCCC CCCTGCTTTG GATCAGGTCC CCCGATTGGC GGCTGACCTT GATGGGAAAG
CTGTTTTTGC TCACCCTGCT GGGAACCACC TTGTTCACCT TTTACGTCGA CGGTGTATCG
CTGTATCGGC TGATCTACTG GGCACCCGGT GTTAATGCCA TAAGGGCTCC GGTAAGAATC
ATCCTGATGC AGTCATTTCT AGTCGCCGTG ATGGTTGCTC TCGCCACAAC CTTCATGGCG
AGCCTGCTGA AGGATCGGCC TGAATGGGCG AAAATAATTC TGGCCGGCGC ACTGCTGATC
CTCATGGTCT GTGACCAAGG GTTGGTGCGT ACATACAAAT TGAGTTATGA CAAATCAGCT
GCACAAAGCC GTGCGCGGGC CCTCGAAGAA CTGGTGCTAG CGCGGGATCC AAAGGCAAAG
CTGTTCCTGT ATCTGCCATC GAACCCCCTT GAGGAGCGGG CAGAGAGGAA CCTCGATGCT
ATGTTGGCCG CGCAGCATAT GGGTATATAC ACCGTCAATG GCCACAGCGG ATACGAGCCC
CGTGACAACC GCCCCGATCC TCGCAAGCCG AACTACTGCT CACATCTGCG GGAGTGGTTC
GTGTCGGCGA AAGCCAATTA CCGTGAGCTG GACAACGTTG ACCTGCTTGA CCATCTCTTG
ATTGTTGGCG ATCTTTCCTG TATTGGAGGT GGAGCCGATT AG
 
Protein sequence
MHDDRHPLTL REYGATVAML LLGMYFIPLS VMHLDLSRIP GDLVDSRLNN YFLEHGYRWV 
TGQVASFWNA PFFFPAPQVM TLSDNHLGTL PLYSLFRLLN FDRETAYQLW MLTLFVLNYL
CAVVVLNLMR FNPVGAAAGG YFFTFSLPMS AQLGHIQLLP RFMIPLAFYF VLRFLQQKRN
RDLAAACACV VMQFYCVIYI GYFLSLGLAF FFASYLILAW DRMEIRAILW GSRKEFSERA
LALVGSGAAL MPLMYPYYLR SIYSEPISWD VISSMLPRAY SYFYTFNESL MWSWLSGLGN
ALPMAHEHRL FVGLLPLLAL MLPPLLWIRS PDWRLTLMGK LFLLTLLGTT LFTFYVDGVS
LYRLIYWAPG VNAIRAPVRI ILMQSFLVAV MVALATTFMA SLLKDRPEWA KIILAGALLI
LMVCDQGLVR TYKLSYDKSA AQSRARALEE LVLARDPKAK LFLYLPSNPL EERAERNLDA
MLAAQHMGIY TVNGHSGYEP RDNRPDPRKP NYCSHLREWF VSAKANYREL DNVDLLDHLL
IVGDLSCIGG GAD