Gene GM21_2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2489 
Symbol 
ID8137830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2912216 
End bp2913466 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID644870098 
Producthypothetical protein 
Protein accessionYP_003022289 
Protein GI253701100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones111 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGGG GGATGGCACT GCTGACTTTT CTAGCCCTGG CTCCTGGTTT GGCGGCTTAC 
GCCCGTGCGG CGGATACCTC TTCGAGTTAC CGCACTCCGC TCGCAGGTGA GCCAGGCGAA
GTTATGTTCA TGGGTGAGAA GGTTACTATC CCTCCCCTTG ATCGAAGCGA TATGACCTCC
ATTACCCTCG GTGCATCTTT GCTTACCCCG CAACAGGGCG GGACCACTGC GCTCCCTGTG
GCTGTTTTCT ACCATCGGCG TATCAAGGAC GACTACCGTG CCCGCTACAC GGTCAGCCTC
TTCGTGAATG AACTGGAGTA CGACCGGAAC CTTGGGGGTG TTGAGCTGGT GACTCACTTC
GAGAACTACA CCCTGCCGGT GCAGCAAAGC GAGATACTTG AAGGCCAGGA TATGAAGGGG
ACCTCCCTTT ACTGGGGTAC CCTGCTCGGG TCGGTGGGGG CGGGATGGCG TATCCCGGTG
CGCCCCCTGG AGGTGGACAA CGACCTGCGG TTGCAACTAT TGGGACGGGT TGGGTATTTC
TATGCCAAAA CCGGCAAGGA TACCGCTGCC GACCTCTCGG TCCCGAACGA CACCATGCTC
TATGGCGCCA GGGCTCGGGT GCACTATGAC ACCATGCGTC GCAACCTGCT GGAACTGCCG
CACCAAGGGT TTGCCGTGGG TGGAGATCTG GATTTGATGC ATCGAGACAA GTGGAGTGAG
CAATCTGCTA CGGCAACGCT GGGTGGAAAC CGGGATTACC TGCAGCTGAC CGGCTATCTT
GCCGGGGCTG CGGGCATACC GGGGAGGTCG GAGCGGGACC GCCTGATTTA CTGTGCCTAT
GCCGGTCATA CCTTCGACAA CAACGGAGAC AGGTTCAACG CCTTCCGCCT CAATGGAGCC
TCTTTCCCGA GCGAGGCCGA CGATGTGGTC CGTCCTCACT ACACCGGCGT CATTTACGAC
AACATCCCGG TAACCTCATA TGCAACGGTC TCAGCGGGTT ATCGCCGCGA GCTTACCTTC
TTTCTTTACC TTAGCCTGTA CGGATCCTAC ATCTGGGCCG ACCGGGCCAC CGTGGAAGGG
ACGAATCGGG TGGCCTTCCG GGACAAAGAA GGAGGCGCAG GGACCATCAC CCTAGATAGC
GCCTTCTTGT GGGACTCATC CTTTTATCTG GCCTATACGT GGGAATCGGG TTTGATTCGA
AACGGCCGGT CCGGGGGGGG GTACACCGTG ATGTGGAACA AGCTCTTCTA A
 
Protein sequence
MVRGMALLTF LALAPGLAAY ARAADTSSSY RTPLAGEPGE VMFMGEKVTI PPLDRSDMTS 
ITLGASLLTP QQGGTTALPV AVFYHRRIKD DYRARYTVSL FVNELEYDRN LGGVELVTHF
ENYTLPVQQS EILEGQDMKG TSLYWGTLLG SVGAGWRIPV RPLEVDNDLR LQLLGRVGYF
YAKTGKDTAA DLSVPNDTML YGARARVHYD TMRRNLLELP HQGFAVGGDL DLMHRDKWSE
QSATATLGGN RDYLQLTGYL AGAAGIPGRS ERDRLIYCAY AGHTFDNNGD RFNAFRLNGA
SFPSEADDVV RPHYTGVIYD NIPVTSYATV SAGYRRELTF FLYLSLYGSY IWADRATVEG
TNRVAFRDKE GGAGTITLDS AFLWDSSFYL AYTWESGLIR NGRSGGGYTV MWNKLF