Gene GM21_3652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3652 
Symbol 
ID8139026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4229877 
End bp4231046 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content64% 
IMG OID644871273 
ProductBaseplate J family protein 
Protein accessionYP_003023431 
Protein GI253702242 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones174 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATCC AAGATCTGGT CTCTAAATCG CTCGATACCA TCCGCCAAGA AATGTTTGAC 
CGCATTGCGG CGGTCCAGGA TGAATATTCC GCCAAGGGGT GGCTCCCCAT CCGGTTGAAC
CTGAACAAGG GGATCGTGCG CGGCATGATC GAGCTCTGGT GCTGGGGTCT ATGGCAGCTG
TACCAGTTCC TCGCCCTGGT GCTGAAACAA GCCTTTCCGG ACACTGCCAC CGGGGTCTGG
CTAGACCTGC ATTGCAAACA AGTCGGCGTT GCTCGGAGAG AGGCCACCAA GGCGGTCGGC
GTCGTCTATT TCACACGTGC CGGCATGGTC GGAAACGTCC CGATTCCCGC CGGGCGCGTG
GTCCGCACCA AGCCGGACGG CAACGGCCTT ATATATAGAT ACGTGACCAC GGCAGCGGCG
GTGCTTTTGA ACGGGGCGAC CGAGGTGGCC GTGGCGGTCG AGGCGGAAGA ATACGGCGCG
GCCGCCAACG CGACGGTCGG GCAGATCTCC GAGATCGTGA CGGTGATCCC CGGCGTCGAC
GCGGTGGAGA ATAGGGCCGA CTGGATCACC AGAGAGGGGA GCGACCAAGA GAAGGACGAG
AGCCTCCGCG AGCGCTACCA GCTGGCCTGG AAGGTGCTGA ACGGCTGCAC CAAGTACGCC
TATGAGGCAT GGGCCAAAGA AGTGGTTGGC GTAGTCGCGG TCAAGATCAG GGACCAGCAC
CCCCGGGGCG AGGGGACGGT CGACGTGGTC ATAGTGGGGA GCGCCGGCGC GCCGACTCCG
GCATTGCTTG CCTCAGTCGA TGCCAACATC AACGGCACGG GGAACGACGA CGAGAAGAAC
CCGATCAACG ACGATGTGCT GGTAGCCGGC GCTGACCTAG TGGCCACCAG CCTCGTCGCG
CAGCTGGAGC TCAGCTACGG CGACCCGGCT GCGCTTCTGC TTGAGGCGGA AAACCGGGTG
CGGGCGCTTT ACTCCACAGT GGCCTCTGTT GCCGGGGTCG TGCCCTTCGG GATCGGCGGG
GACGTGACGC GCGACCGGCT GGTGTGGGCG ATGATGCTGC CCGGCGTAAA GCGGGTCAAC
ATGGTGTTCG CGGACGTGGC GGTACCGGAG TACGGGCTTG CCACATTGAC CGATCTCACC
CTCACCTACG TGCTGGCCGC AGAGGCATAA
 
Protein sequence
MSIQDLVSKS LDTIRQEMFD RIAAVQDEYS AKGWLPIRLN LNKGIVRGMI ELWCWGLWQL 
YQFLALVLKQ AFPDTATGVW LDLHCKQVGV ARREATKAVG VVYFTRAGMV GNVPIPAGRV
VRTKPDGNGL IYRYVTTAAA VLLNGATEVA VAVEAEEYGA AANATVGQIS EIVTVIPGVD
AVENRADWIT REGSDQEKDE SLRERYQLAW KVLNGCTKYA YEAWAKEVVG VVAVKIRDQH
PRGEGTVDVV IVGSAGAPTP ALLASVDANI NGTGNDDEKN PINDDVLVAG ADLVATSLVA
QLELSYGDPA ALLLEAENRV RALYSTVASV AGVVPFGIGG DVTRDRLVWA MMLPGVKRVN
MVFADVAVPE YGLATLTDLT LTYVLAAEA