Gene GM21_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3159 
Symbol 
ID8138511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3669133 
End bp3670251 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content59% 
IMG OID644870764 
ProductNHL repeat containing protein 
Protein accessionYP_003022944 
Protein GI253701755 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02145] Fibrobacter succinogenes major paralogous domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones119 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTAA GCGTCTTTCG TTGGCTATCG CGGCACAAAC CGATGAGGGG GGTGGTCGCC 
GCCCTGGCGC TGCCGTTGTG TCTGCTCTGC GCTTCGTGCG CGACTCTCGA GCCGACGCAG
CCGATAGCGG ACCGGCAGGA ACTCCAGTGG CCCCCCTTGC CGCTCATGCC ACGCATACAA
TGGGTCAAGG AGATCCGGGA TCCACATGGA GCCGGGATTG AAAAGGGGAT GTGGCGCCGC
TTCACCGAAG TCTTCACCGG CGCAGTGGAA AACAGGATAG GCAAGCCGTA CGGCGTCTAC
TTCGACGAAC GCGGCAGTCT CTTCGTGGTG GATGTATCCT ATGGCGTCGT CCACGTAATG
GACACCAGGG AGAAAAGCTA CTACATCATC GGCGACGCCG AAAAACGCAC CTTCCGCTCG
CCCATCGGCA TTACCGAGGA CGATCGGGAC AACCTCTACA TCACCGATTC CGGCGCCGCC
GGCATCTTTC GCTACAACCT GACGAAAAAG AAACTGGAGC CGTTCATCAT TTCGGACCTG
GCTCGCCCCA CCGGCATCGT CTTCAATAAA AGCAACCGCC TTCTCTACAT CACCGACACC
ACCGAACATC AGGTGGTTGT CTTCGATCTG AAGGGCAACC TCCGTTACCG CATCGGCAGC
AGGGGAAGTG CCGCTGGGCA TTTCAACTAT CCGACCGATA TAAATGTGGA CAACAGCGGG
CGACTGTACG TGACCGATGC GCTCAACTCG CGCATTTCCA TCTTCTCCGC CGAGGGGACC
CACCTTAACA GCTTCGGCAG GTCAGGGGAC ACGGCTGGCA ATCTGCCGAA AGCAAAAGGG
GTCGCCGTCG ACAGCGCAGG CAATATCTAC ATAGTCGACG CCCTGCTGGA CGCGGTGCAG
ATATTCGACC AAAGCGGCGT GCTGCTGCTG ACCTTCGGCA GCAACGGCAC CAATGCAGGG
GAGTTCTGGA TGCCTTCGGG CATCTACATC GATCGCAACG ACTACATCTA CGTTTCCGAC
TCGTACAACC GCAGGATCCA GGTCTTCAAG TACCTCGATG TCAAAGACGC ATCGGCGCGG
GCCGTCGAAC CAGCCCGCAA CAAGCAGACG GAGCAGTAA
 
Protein sequence
MAVSVFRWLS RHKPMRGVVA ALALPLCLLC ASCATLEPTQ PIADRQELQW PPLPLMPRIQ 
WVKEIRDPHG AGIEKGMWRR FTEVFTGAVE NRIGKPYGVY FDERGSLFVV DVSYGVVHVM
DTREKSYYII GDAEKRTFRS PIGITEDDRD NLYITDSGAA GIFRYNLTKK KLEPFIISDL
ARPTGIVFNK SNRLLYITDT TEHQVVVFDL KGNLRYRIGS RGSAAGHFNY PTDINVDNSG
RLYVTDALNS RISIFSAEGT HLNSFGRSGD TAGNLPKAKG VAVDSAGNIY IVDALLDAVQ
IFDQSGVLLL TFGSNGTNAG EFWMPSGIYI DRNDYIYVSD SYNRRIQVFK YLDVKDASAR
AVEPARNKQT EQ