Gene GM21_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1408 
Symbol 
ID8136736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1655234 
End bp1656517 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content62% 
IMG OID644869022 
Producthypothetical protein 
Protein accessionYP_003021225 
Protein GI253700036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00000000450965 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAGGT ACAATGCGAT GCGGGAAATT CCCGCTACTG CCGGATTCAA GGAAGGGGAC 
GTATTTTTTC TCTGCGGCGA GCTCTTCGGC CGCGGCTATG CCAACGGCAT CGTCGACGAA
GCAAGGGCCA AGGGGATGAC CATCATCGGG GCCACCGTCG GCAGGCGCGA CAACGACGGG
ACCCTGCGCC CGCTCAACGC CGAGGAACTG GCAACGGCGG AAGAGAACCT GGGCGGCAAG
ATCATCAATA TCCCGCTGGA AGCGGGCTTC GACATGGAGC CGGGGAGCGA CGGGATCGCA
CCGGTCGACA GGTTCAAAGG TGTCAAACCG GACGACTGGG CCTCGGTGAA ACTGGACCAG
GCCGAGATTG AATTCTCCAA AAAGCGCGGC ACCGAGCGTT TCTGCAAGAA CCTCGCGGCT
GTGGTGGCCG AAGTGGAGAA GATGCTCCCG GCCAAGGGGA GGCTCCTGGT GGTGCACACC
ATGGCAGGCG GCATCCCGAG GGCGCGCGTG TTCATGCCGA TCCTCAACAA GCTCTTCAAG
GGACAGGGAG ACCGCTTCCT CTCCTCCGAA GCCTTCTGGA ACTCCGACAT GGGGCGCCTT
TGCGACGCGA GCTTCAACGA AGTGACCGCC GACACCTTCC GCTACCTGAT CGACGCCACC
GCTGGCCTCA GGGAGAAGCG CGAGGTAACC TACGCGGCCT ACGGCTACCA CGGCACCGGC
GTACTCATCG ACGGCGTCGT CACCTGGCAG TCCTACACCC CGTACCTGCA GGGGTGGGCG
AAGATCCGCC TGGAAGACAT CGCCATCGAG GCATGGGAAA AAGGGATCAA GGCAACCGTC
TACAACTGCC CGGAGATCCT CACCAACTCC AGCGCCCTCT TCCTCGGGGT CGAGAATTCC
CTCTATCCGC TGATGGCCTC GCTCAGGGCC GAAGGGGAGC AGAAGATCGT CAAGGAGTGC
GAGGCGCTCT TGAAGGAAGG GGCCACCGTC GACACGCTGC TCGACATCGC CAACACCTAC
CTCACCTCGG ATCTGGTCAC CAGCACCCGG GACTTCGACA GCTGGCCGCA GCACAACCAG
CCGCAACAGC AGGAGTACAT GCTGAACGTG TCGGCGGAGC TGATCAGCCT GAACGCGGAC
CCGAAGGAGA TCGTCTGCGC CGTCCTCTCC AAGGGGGTGT TCCAAGGGGT AGGGAAGCTG
ATGTTCGACA GTTCCTGGGA GCCGAAGGCC CCGGTCTTCT GGTTGAACCA CGACGTGATC
GCGAAGACGC TGGTGAAGAT GTAA
 
Protein sequence
MSRYNAMREI PATAGFKEGD VFFLCGELFG RGYANGIVDE ARAKGMTIIG ATVGRRDNDG 
TLRPLNAEEL ATAEENLGGK IINIPLEAGF DMEPGSDGIA PVDRFKGVKP DDWASVKLDQ
AEIEFSKKRG TERFCKNLAA VVAEVEKMLP AKGRLLVVHT MAGGIPRARV FMPILNKLFK
GQGDRFLSSE AFWNSDMGRL CDASFNEVTA DTFRYLIDAT AGLREKREVT YAAYGYHGTG
VLIDGVVTWQ SYTPYLQGWA KIRLEDIAIE AWEKGIKATV YNCPEILTNS SALFLGVENS
LYPLMASLRA EGEQKIVKEC EALLKEGATV DTLLDIANTY LTSDLVTSTR DFDSWPQHNQ
PQQQEYMLNV SAELISLNAD PKEIVCAVLS KGVFQGVGKL MFDSSWEPKA PVFWLNHDVI
AKTLVKM