Gene GM21_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1245 
Symbol 
ID8136570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1452863 
End bp1455325 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content50% 
IMG OID644868859 
Producthypothetical protein 
Protein accessionYP_003021064 
Protein GI253699875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value7.162309999999999e-20 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAAAAA AGAAATCGGG TGGCGGTGGA GCTGCACTAA TCGGCCTGAT TGTCTTAGGC 
ATGATAGTGA AGTATTGGTC TGTGTTTTTC CCCTTGACTG TTCTCGGTCT GATAATTTGG
GGCATAGTAA AATTGTCAAA GAATTGGTCC ACAGCTGACT CGAAATCAAG CCCCTCGCAG
ACAGCATTAG ACCTGACGCG GCCAGAAAAG ACAGTAGTTC CCCCAGCTAA GACAGCCACC
CCAGTCCCGA CCATAAAAAT CGAAGTAAGT ACGAGTACGA ATTACCAGTC CCCGTCATCA
TCTCCGAAGG AGTCGCCCTA CGTGCCACCA TATCGGACGG ATGCCACCAT TTTCGGGGCA
AGTGCCCAGC AATCAAATTC TGTATCAGCT GACTCCTTTT GGGTTCCATG TGGTCGCACG
ATTCAGGTTG CAGGCTATTC GATTCCGGGC GGGATGGTTT ATCATGGCAC TGGGTTGAAG
TCCGTAAATC AGTACAACGA TGAACCGGCT CTGATAAGAC CGAAGCTTAA ACTAGATTCA
GCCAACCCCG ACCGGGAAGG TCGTAACATA GGGTATTGGC CATGCTATTC TCAGATTCAC
CCCACATCAC GAGCTGCTTT CCTGGAGTGG CTTTCCACTG GACGAAAAGA CCCCAACACC
AACATCGGCT ACGTCTTCAT CTTTTTCTAT GGCCTTGAAA GACGAGCCTT TATCGATGCC
AAGGAATCTG CCGCAGCACG TAACGAAATC CCCACTATTG CAACGGAAGT AAAACGACTC
CTTTCTATCT ACGGCGAAAA CAACAGCTTC CGTGGATACG CAAGCAAGTT TCTGGACGCA
ATACAGTCGT CCCAAGTGAA GGCACATCTC TACCGTACAG CGCCACAGAT TGACAATGGA
TGCTCCTGGG AGATTCCGCT GACCCTTAAG ATAGCCCTTG GTCAGGTAGC CAATGATGGT
GTTCCTCTGC CTGCCGAATG GGCATTGGCG TGGGCAGAAA ACGACCCGTC AATGCCTCGC
AGAATGCCAA GCCAGCGTTG TCAGGCCGAG TTCCGCGAGC TATTCAAGAC TCGCTACAGC
GAAAAAATGG GAGAAGGCTT GAAACTGAAG CCAAACAAAA CGAGGCTCAA GGCAAGTTAC
TTCCCGGCCA CTTCATCTTT TAGGGGGAAT ATTGAAATCC CGATTTTGGA TTTGCCGGAT
GTTACAGAGA CTACCGGCCC AGCTAACAAG ATTCGGGACA TTGCCAATGC CTGTACTGAC
GAACTGGAAA GTTATAGCCG TTATCTTGGC CGGAACCCGG AAGGAAGGAA CTCGATTGAG
GCCACGTCAT ACCTTCCCCA ACCCCTGTTG ACCAAACATG CCGGAAAAGA CTTCCAGAAG
CTGAGTGACT GGCTGTCTGT GCAGGTTCAC GCCGATAAGC CCGAGTGCTT TTCCTTCTCA
ATGCTGTTGG AGCACATTTC GTCAATCAAG CCTGATGGTT TCGGCAAGAA AGAGGCAACT
GCTATCGCTA ACCTGCTGGC CAAGATGAAA ATCGGTATTG AACCTGACCC ACGCTTTGGC
AATTTCATTC CTAAAATCGG CCAGGATGTC GTTCTGTTCA AAATCAGTGA CAACGCCCCA
AGTTCTCCTT CAACCGAGTT TTCTGCCGCT GCCGTCGTGC TCCACTTGGC TTCAGCTGTC
GCCAATGCCG ATGGCTTCAC TGACTCTACA GAAGAGCGTC ATCTGGAAGA GCATGTCGAA
ACATGGCTGC ATCTGTCACC GGACGAAAAG ACAAGACTAC GGGCTCATAC TCAGTGGTTA
CTTTCTGCCT TTCCCGGCAT GAATGGGGTC AAGAAACGGA TTGAAGTTCT CAAGCAAGAA
CAGAAGGAAT CTTTAGGACG ATTCCTTGTC GGAGTCGCGC AGGCCGATGG CTATATCGAC
CCCACAGAAA TGAAGACCCT CACGAAGATT TACGAAATGC TCAGCTTGGA TACTCAGAGC
CTTTACAGCC ATGCCCATGC TGCTGCTGTG GAACCTGTAA CAGTACAAAC CTCCGATTTC
GTGAAGCCGC AGGGCTACGC TATCCCAACA CCTCCTCCTA AACCTTGTGA GGGCGTATCT
CTTGATATGA GCGCCATCGA AGCGAAACTT GCTGAAACCG TTGCAGTGTC GGCTATTCTG
AGAAATATCT TTACGGATGA TGAGCCGGTC GCAACTCAGT CATCAGGTAC TGTGGTAACT
ACACCGGAGG TTTCCGTTGC TGGACTCGAC CCTGAATCAT TTACCTTCAT GCAAGTATTG
GCTTCCAAGC TTGTCTGGGC CAGGGAAGAG CTGGAAGAAC TTGCTGCAGA CCATAGCCTG
ATGCTAGACG GCACACTCGA CACCATCAAC GATGCATCGT TTGACCATTT TGGTGGGCCG
TTCTTCGAGG GTGACGACCC CATAGAAATC AATGCTGAAT ATGCCAAGGA GATATCCGCA
TGA
 
Protein sequence
MSKKKSGGGG AALIGLIVLG MIVKYWSVFF PLTVLGLIIW GIVKLSKNWS TADSKSSPSQ 
TALDLTRPEK TVVPPAKTAT PVPTIKIEVS TSTNYQSPSS SPKESPYVPP YRTDATIFGA
SAQQSNSVSA DSFWVPCGRT IQVAGYSIPG GMVYHGTGLK SVNQYNDEPA LIRPKLKLDS
ANPDREGRNI GYWPCYSQIH PTSRAAFLEW LSTGRKDPNT NIGYVFIFFY GLERRAFIDA
KESAAARNEI PTIATEVKRL LSIYGENNSF RGYASKFLDA IQSSQVKAHL YRTAPQIDNG
CSWEIPLTLK IALGQVANDG VPLPAEWALA WAENDPSMPR RMPSQRCQAE FRELFKTRYS
EKMGEGLKLK PNKTRLKASY FPATSSFRGN IEIPILDLPD VTETTGPANK IRDIANACTD
ELESYSRYLG RNPEGRNSIE ATSYLPQPLL TKHAGKDFQK LSDWLSVQVH ADKPECFSFS
MLLEHISSIK PDGFGKKEAT AIANLLAKMK IGIEPDPRFG NFIPKIGQDV VLFKISDNAP
SSPSTEFSAA AVVLHLASAV ANADGFTDST EERHLEEHVE TWLHLSPDEK TRLRAHTQWL
LSAFPGMNGV KKRIEVLKQE QKESLGRFLV GVAQADGYID PTEMKTLTKI YEMLSLDTQS
LYSHAHAAAV EPVTVQTSDF VKPQGYAIPT PPPKPCEGVS LDMSAIEAKL AETVAVSAIL
RNIFTDDEPV ATQSSGTVVT TPEVSVAGLD PESFTFMQVL ASKLVWAREE LEELAADHSL
MLDGTLDTIN DASFDHFGGP FFEGDDPIEI NAEYAKEISA