Gene GM21_0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0488 
Symbol 
ID8135797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp600822 
End bp601958 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content65% 
IMG OID644868106 
Productprotein of unknown function DUF362 
Protein accessionYP_003020326 
Protein GI253699137 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00000000573223 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACCAGG TCGCGGTAGA GAAGGTCGGG GATTACCGGC GGGAGCCGGT CCAGGAAGGG 
GTGGCGCGGC TTCTGGCCAG GCTCGGGGGG ATGGAGCGCT TCGTGAAGCC CGGCGAGCGG
GTGCTGATCA AGCCGAACCT CCTTTCCGCG AAGCCCCCCG AGGCCGCCGT CACCACGCAC
CCGGAGCTGT TGCGGGCCGT GATCCTGCAG GTGCAGCAGG CGGGGGGGAT AGCGCTCGTC
GGGGACTCCC CGGGGATAGG GAGTGCCGTA AAGGTCGCTA GACGCTCGGG GATGCTGGCG
GTGATCGAGG AAACCGGGGC CGAATTCGTC CCCTTCGTGG AGAGCCGCGA GGTCGCAGGC
TCCGGGGTTT ACCGCCGTTT CGAGCTGGCA GCCCCCTACC TGGAGGCGGA GCGCCTGATC
AACCTCCCCA AGCTGAAGAC CCACGAGATG ATGACCATGA CCTGCTGCGT CAAGAACCTC
TTCGGCGCCA TAGTGGGGAC GCAGAAGGCG GCCTGGCACC TGAAGGCCGG GGCGGACAAG
GATCTATTCG CCCGGATGCT GTTGGAGGTG TACCGGTTGC GCGAGCCGGA TTTGAATATC
GTGGACGCCA TCGTAGGGAT GGAGGGAAAC GGCCCGGGAA GCGGCGACCC CTGCCAGGTA
GGTCTCCTTT TGGCAGGGGA TAACGCCCTC GCGGTGGACC AGGTGGCCGC GGAGATCGCC
GGCATCCCCA AAAAGCTCCT CTACGTGGAA AACGCCGCGC GTCGGATGAA GCTTCCCGGA
GCCGAGCGTG CGGAGGTCGA GTACCTGGGG CTTAATTCTA ATGAAGTCCC TTTCCGGAGC
TTCCGGCTCC CCCATCTGTC AGACGTCCAG TTCGGACTCC CCGGCTTCTT GAAGCATCGG
CTGCGAAACC AGTTGACCTC CCGCCCCGAG GTGGTGGACG GTGCGTGCCG GCTCTGCGAA
ATCTGCGTCA GGGCTTGTCC TCCGGGCGCG ATCTGGGTGG AGGGGGGGAG GCTGCGCTTC
GATTACCGGC GCTGCATCCG CTGTTTTTGC TGCCGCGAAC TCTGTCCGCA CGCGGCGCTT
AGGCTCAAGG ACGGCTGGCT TCTTTCGCTA ATAAAAAAAA GTGGCACACC CCTTTAA
 
Protein sequence
MYQVAVEKVG DYRREPVQEG VARLLARLGG MERFVKPGER VLIKPNLLSA KPPEAAVTTH 
PELLRAVILQ VQQAGGIALV GDSPGIGSAV KVARRSGMLA VIEETGAEFV PFVESREVAG
SGVYRRFELA APYLEAERLI NLPKLKTHEM MTMTCCVKNL FGAIVGTQKA AWHLKAGADK
DLFARMLLEV YRLREPDLNI VDAIVGMEGN GPGSGDPCQV GLLLAGDNAL AVDQVAAEIA
GIPKKLLYVE NAARRMKLPG AERAEVEYLG LNSNEVPFRS FRLPHLSDVQ FGLPGFLKHR
LRNQLTSRPE VVDGACRLCE ICVRACPPGA IWVEGGRLRF DYRRCIRCFC CRELCPHAAL
RLKDGWLLSL IKKSGTPL