Gene GM21_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3117 
Symbol 
ID8138467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3617096 
End bp3618955 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content65% 
IMG OID644870721 
ProductTonB-dependent receptor plug 
Protein accessionYP_003022903 
Protein GI253701714 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones158 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGACGTA ATTTCATAAA CAGATTGCGG TTTCCATGCG CGCTGCTCTG CGCCGTCCTG 
GCCGCGGCTC CCGCGCTGGG GTCGGGGCCG GATGAATTGA GCCGTTCCTT CGGCCTCTCC
GACCCACCTC CCGAACTTGC GCGCTTTCCC CGCCCTGCCT CCCGCATCGC CGAAAACGTC
ACCGTGGTCA CGTCGGAAGA TATCGCCCGC ATCAACGCCC ACACCGTCGC CGAGGTGCTG
CATACCCTCC CGGGAATCCA GATGGACCAG TTGCAGACGC CGGGGTCGAT CAACTTCTTC
ACGGTCCTGG GGGCATCGAG CCGGCACGTC CTGGTGCAGA TCGACGGGGT CGCTCAGAAT
TTCATCGGCT CGGAGAACAT AGCCCAGGTA GGGCTTATCC CGGTGCAGAT GGTCGAGCGG
ATCGAGGTGG TGAAGGGAGC GGCGTCCGCC GCCTGGGGTT CGGCGCTCGG CGGGGTGATC
AACATCGTCA CCAAGAGCCC TGCCGCGGAC CGCGCCGGCA TGGTTTCGGG GTCCGCCGGC
AAGAGCGCCA CCCGTGACCT GCGCGCCGAG GCGACCGGCA GCGGGGAGCG CTTCGGCTAC
TACCTGACCG CCGGCAAGCT CCGCTCCGAC GGCTTGGTGG CCGGCACCGA CACCGACATG
AACCACGGCT TCGGCAAGCT GAGCTATGAC TTCCCCAACG GGGGGGATCT GGCTCTGAGC
GTCGACGCGC GGCACAGCAT AAACGGCACC GAGGAGTCGC GGCTGTACGA TTTTCACAAC
ACCGGCGCCG CGACGCATAC CACGGGGAAC CTGTTGCTCC ACTACCCGTT GGCGGAGCGG
CTTGCCCTGG AGGCGAGCGC GCACACGGGG AGGCGGGAGG CGCAGAACCG GCGGGCGGTC
CTCACCACGG GGGCCCTCTT CGCCGACTTC AAGGCCCATG AGGGGTTCCG GGGAGCCGAC
GCCGCGCTCA GCTGGGGAGA CGCAAGCCAT GCCCTGAAGG CGGGGGTCGC CTACGAGGAG
ACCGAGGTCG GGCTGCAAGA GCCGGTAAAT CTGCTGCCGC AGATGAACTA CGATCTCGCC
CTCAGGCGGG AATCGGCCTA CCTGAGCGGT ACCTACACGG TGGGGAGGCT CTCCATCCTG
CCGGGGGTGC GCCTGGACCG GCTCAGCCTC ATGGAAGACC CGGTCAGTTT CACTTTGGGA
GCAACGCTTC GCCTGACGGA CAACACGCTG CTGCGCGGCT ACGCGGCGTA CGGCTACGGC
ATGCCCATCG TCAGCAATAT CGGCATGCAA AACGGAGAGG TGCGGCGGGA CCTGCAGCAT
GTGCGTACGG TGCAGATGGG GCTTGAGAGC GCGGATCTAC CGTTTGTCTG GGTGAAGACG
ACCTTCTTCT ACGCCAAGGT GAACATCGAG GATTTCGACC AGCCGACGCC GCAGCTAAAC
CAGCAGGTGA AGCAGGGCGC CGAACTTGAA CTAAAAAGCA CCCCCTGGTA CGGCTTCACC
CTAAGCGGCT CCTACAACTT CACCGACGCC CGCGACCACA GGACCGGCGA AAAGCTTTCC
GCCATAGACT CGGGGCCGCG GCAGGCGCTG AAGGTCGGGG TGAGGTACGA CAGCGAACCG
GCCGGGCTCT CGGCTACGCT TCTGGGGGAC TGGGCGAAGC TGCACAACCC GGGCCGCTTC
GCTAACACAA AAGGGGAGTT CTGGGACCTC CACCTGACCC AGAAGCTCTC CCCCGACTCG
GACAGCTCCC CCGAGCTGTT CCTGTCCTGG CACAACATCT TCGACAGCGA ACGGTACTTC
TACGACTTCC GGGCCAACGC GCCGCGCTGG TTCGAGGCAG GGGCGAGGTG GTTTTTCTGA
 
Protein sequence
MRRNFINRLR FPCALLCAVL AAAPALGSGP DELSRSFGLS DPPPELARFP RPASRIAENV 
TVVTSEDIAR INAHTVAEVL HTLPGIQMDQ LQTPGSINFF TVLGASSRHV LVQIDGVAQN
FIGSENIAQV GLIPVQMVER IEVVKGAASA AWGSALGGVI NIVTKSPAAD RAGMVSGSAG
KSATRDLRAE ATGSGERFGY YLTAGKLRSD GLVAGTDTDM NHGFGKLSYD FPNGGDLALS
VDARHSINGT EESRLYDFHN TGAATHTTGN LLLHYPLAER LALEASAHTG RREAQNRRAV
LTTGALFADF KAHEGFRGAD AALSWGDASH ALKAGVAYEE TEVGLQEPVN LLPQMNYDLA
LRRESAYLSG TYTVGRLSIL PGVRLDRLSL MEDPVSFTLG ATLRLTDNTL LRGYAAYGYG
MPIVSNIGMQ NGEVRRDLQH VRTVQMGLES ADLPFVWVKT TFFYAKVNIE DFDQPTPQLN
QQVKQGAELE LKSTPWYGFT LSGSYNFTDA RDHRTGEKLS AIDSGPRQAL KVGVRYDSEP
AGLSATLLGD WAKLHNPGRF ANTKGEFWDL HLTQKLSPDS DSSPELFLSW HNIFDSERYF
YDFRANAPRW FEAGARWFF