Gene GM21_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0104 
Symbol 
ID8135407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp125642 
End bp126718 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID644867724 
Producthypothetical protein 
Protein accessionYP_003019948 
Protein GI253698759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.00147272 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCACAA GTGACCGCGG GCATGGCTCG AAGAGCTGGG GCGTCGCCTG GGGCGTTGCG 
GCCGTCGCCG GGATCGCGCT CTGGGCGGTC CTGCTCCGGG GGGATGACCC CGCGCGGGCC
TGGCGCTCGC TCCTGGTCAA CTTCCTGTTC TTCAGCTCGC TTTCCGCCGG GCTCGTGGTC
TGGCCGGCGC TGGTAAGGAC CTGCAACGGG AAATGGCAGC TGGGGGTGGA ACGCCACGCC
AGCGCGGCAA TAGCCTTCGC CCTCCCCTCC CTCCTCGCCC TCGCCCTCCT CTGGGGGGGA
AGCGGCGCCT GGGCGCCTTG GTACCGGGCG AACTTCCACC AGGGCCTTTG GTTGAACAAC
AGCTTTCTCT TCGCGCGGGA CCTGGCGGCC CTCCTGTTGT TCTGGGGCTG GGCCGCGTTT
CACCTGGCGC GGCGGCGCCA AGGGAACGGC AGGCGCTCGG GGGTCGTCCT CCTGGTGGTC
TACGCCCTCA CCTTCTCCCT CTTGGGTTTC GACCTGGTGA TGGCGCTCGA TCCGCACTTT
CACAGCAACC TGGCCGGCGG CTACTTCTTC ATGTCCGGGC TCTACATCGG CATCAGCGGC
TGGGCCCTCA TCGCCTGCCT GAAGGGGGGG GCGAAGCCCA AGCAGTTGCA CGACCTGGGG
AAGCTCATGC TCGCCTTCAG CCTGATGACC ACCTACCTCA TGTATGCGCA TCTGCTCCCC
TTCTGGTACG AGAACCTCCC CCCGGAGATC CGTTTCCTGG TGCCGCGCAT GCACAACGAA
AACTGGTCGC CGGTGAGCGT GCTGCTGCTC TGCACCGTCT ACTTCGGTCC GCTGGTGCTG
CTCCTTCCCG CCCGCTTCAA GCAAAACCGC TATACGCTGG GCGCGGTAGC CCTCTTGGTC
GTGGCCGGGA TGTGGCTTGA GCGCTGGTGG CTGGTGGCGC CGACCTTCGA CCCGCTGGCG
AGGCTCGGCC TGAGCGAGCT ATCGCTCGCC TTAGGCTGTA CCGGGCTCCT CGGGCTGGGG
ATGCTGATCA GCCCGCGCCA CCTGCCGAGC GATGCGCCGG AGGGGGATGA GCCGTGA
 
Protein sequence
MSTSDRGHGS KSWGVAWGVA AVAGIALWAV LLRGDDPARA WRSLLVNFLF FSSLSAGLVV 
WPALVRTCNG KWQLGVERHA SAAIAFALPS LLALALLWGG SGAWAPWYRA NFHQGLWLNN
SFLFARDLAA LLLFWGWAAF HLARRRQGNG RRSGVVLLVV YALTFSLLGF DLVMALDPHF
HSNLAGGYFF MSGLYIGISG WALIACLKGG AKPKQLHDLG KLMLAFSLMT TYLMYAHLLP
FWYENLPPEI RFLVPRMHNE NWSPVSVLLL CTVYFGPLVL LLPARFKQNR YTLGAVALLV
VAGMWLERWW LVAPTFDPLA RLGLSELSLA LGCTGLLGLG MLISPRHLPS DAPEGDEP