Gene GM21_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1035 
Symbol 
ID8136357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1216481 
End bp1218466 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content65% 
IMG OID644868646 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003020854 
Protein GI253699665 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGA CGCGGGAAAT TTACTGGAAC GTCAATCACG CTCTGATCTG GGTGATGTAC 
CTCTTCGCCT TTCTGGCCCT GGCGGCTTGC GCCTGGGGCT TTTGGCGGCG TCTCCCCATG
TACCGGCAGG GAAAGCAGCC GCTTAACCGG CTGGACCGGC TCCCCGAGCG CGTCCGGCAC
TTTCTCAAGG GGATGTTTTC GCAGGTGAAG GTGCTGCGGG TGCCCGAGCC GGGGACGCTG
CATGCATTTT TCTACTGGGG GTTCCTGCTC CTTTTCATCG GGACCCTCTT GATCATGCTG
CAGGCCGACT TCACCGAGCC CTTGTTCGAC ACCGTGTTTT TGCAGGGGAA TTTCTACCGC
GGCTATTCCC TGGTGCTGGA TCTGGCGGGG CTAGCGGCGA TCGTCATGCT GGGGGGGCTT
TTGGTGCGCC GCTGGTTCGT GAAGCCGAAA GGGCTCCCGA CCGGCGGGGA CGATTACCTG
GCGCACGCCC TTCTCTTCGC CATCCTCGTG ACGGGGTTCG TCGTGGAGGG GCTCCGCATG
GCCTCGACCG AGATCGGGAT CAACCCGGAA CTGGCGCGCT GGTCGCCGGT AGGGGGGCTC
TTCGCCCGTC CCTTCGTCGG GATGGATCTT GGGCGGCTTT CCCTGATCCA CAAGACTCTT
TGGTGGGGGC ACCTGTTCCT GGCGCTCTTC TTCATCGTTG CCATTCCCTT CACCAAGCTG
CGGCATCTGT TCACCACGCC GGTCAACTAC CTCTTTACCG ACTTAAGGCC CAAGGGGGCG
ATCGCGACCA TCGACCTGGA GGACGAGGGG GCGGAGCAGT TCGGCGTCGC CAAGGTGACG
GATTTTTCCT GGAAGGACCT CTACGACCCC GATGCCTGCA CGGTCTGCAA GCGCTGCCAG
GACCGCTGCC CGGCCTGGAA CACGGAAAAG CCGCTTTCCC CGATGCATGT GGTGCTGCAG
ATAGGGGAGG TGGCGGCGGC GACGCCGCAG GCGGATCTCT GCCGGACCGT CACCGAGGAG
GTCCTTTGGG ACTGCACCAC CTGCCGGGCC TGCCAGGAGA TCTGCCCGGC CGAGATCGAG
CATGTGAACA AGATACTCGA GATGCGCAGG AACCTGGCGC TCATGGAAGG CTCCTTCCCC
GGCGAGGAGG TGCGCGTGGC CATGGCCAAC TACGAGGTGA ACGGCAACCC CTTCGGCATG
GCCTACGCCG AGCGCGGCGC CTGGGCCGAA GGCCTCGACG TCGCCGTCAT GGAGAGCGGC
GCCGCGGTCG ACGTCCTCTA CTTCGTCGGC TGCTACGCCT CCTTCGACCG CAGGAACCAG
GAGGTGGCCC GCGCCTTCGT GAAGCTCTGC AACGCCGCCG GCGTCAGGGT CGGCATCCTC
GGCAAGGAGG AGAAGTGCTG CGGCGAGCCC CCCAGGAAGC TCGGGAACGA GTACCTGTAC
CAGGGGATGG CGCAGGAGAA CATCGAGAAG ATCAAAGGGT ACGGGGTGCC GCGGGTGGTG
ACCACCTGCC CGCACTGCTT CAACACCCTG GCCAGGGATT ACCGCGATCT GGGCTTCGAC
ATCCCGGTCG AGCATTACAC CACCTTCCTC CATGACCTGG TGCAGCAGGG GAGGCTGAAG
CTGAAAGCGG AGCCGTTTGC CTGCACCTAT CACGATTCCT GCTACATAGG GCGCTACATG
GACATCTTCG AGGAGCCGCG CGAGCTTTTG GATCGCGCCG GCGCTAGCAT CGCCGAAATG
GGAGCGAGCC GCCTGGAGAG CTTTTGCTGC GGCGCCGGCG GGGGGCGCAT CCTGGCGGAG
GAGAAGCGCG GCACGCGGAT CAACGTGGCG CGGGTGCGGA TGGCGCAGGA AACCGCCGCT
CCCATGCTGG TTTCCAACTG CCCGTTCTGT CTCACCATGT TCGAGGACGG CATCAAGACC
GGAGGCGCTG AGGGGACGGT CGCCGCAAGG GATCTGGCGG AGATTCTCGC GGAGCGGATC
GCCTGA
 
Protein sequence
MEATREIYWN VNHALIWVMY LFAFLALAAC AWGFWRRLPM YRQGKQPLNR LDRLPERVRH 
FLKGMFSQVK VLRVPEPGTL HAFFYWGFLL LFIGTLLIML QADFTEPLFD TVFLQGNFYR
GYSLVLDLAG LAAIVMLGGL LVRRWFVKPK GLPTGGDDYL AHALLFAILV TGFVVEGLRM
ASTEIGINPE LARWSPVGGL FARPFVGMDL GRLSLIHKTL WWGHLFLALF FIVAIPFTKL
RHLFTTPVNY LFTDLRPKGA IATIDLEDEG AEQFGVAKVT DFSWKDLYDP DACTVCKRCQ
DRCPAWNTEK PLSPMHVVLQ IGEVAAATPQ ADLCRTVTEE VLWDCTTCRA CQEICPAEIE
HVNKILEMRR NLALMEGSFP GEEVRVAMAN YEVNGNPFGM AYAERGAWAE GLDVAVMESG
AAVDVLYFVG CYASFDRRNQ EVARAFVKLC NAAGVRVGIL GKEEKCCGEP PRKLGNEYLY
QGMAQENIEK IKGYGVPRVV TTCPHCFNTL ARDYRDLGFD IPVEHYTTFL HDLVQQGRLK
LKAEPFACTY HDSCYIGRYM DIFEEPRELL DRAGASIAEM GASRLESFCC GAGGGRILAE
EKRGTRINVA RVRMAQETAA PMLVSNCPFC LTMFEDGIKT GGAEGTVAAR DLAEILAERI
A