Gene GM21_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0967 
Symbol 
ID8136288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1144013 
End bp1145332 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content61% 
IMG OID644868581 
Productprotein of unknown function DUF21 
Protein accessionYP_003020790 
Protein GI253699601 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value0.854838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACACAA TCTTCGTCGA ACTGCTTGTC ATCGCCATCC TCATACTGCT GAACGGATTT 
TTCTCCTGCG CCGAATTCGC CATCATCTCC ATCAGAAAGA GCCGCGTTGC TCAACTGGTC
GCCCTGGGCG ACCAGCGTGC CGCGCTCGTG GAATCCCTGC AGAAGGATCC GCACCGCCTG
CTGGCCATCG TGCAGATCGG CGTCACCGTG GTCGGATCGA CCGCCTCCGC GGTGGGGGGC
GTGATAGCGG TCGACTACAT CCGCCCCATC CTGCAGCTAT CCCCCTTCGC CATGATACGC
AACGCCGCCG AACCGCTCTC CCTCACCATG GTGGTAGCCG TGATCTCGTA TCTCCTGCTC
ATCCTGGGAG AACTGGTTCC GAAGACCATC GGCCTGCAGT ACGCCGACCC CGTGGCGCTT
CGCATCGCCA AGACCATCAC CTTCCTGGCG AGGATCGCCA GTGTCCTGGT ATCGTTGCTC
AGCTACTCCA CCAGGGGGGC GCTGGCGCTG TTCCGGATCA AGGGTGAGGG AAAGGCCTTC
ATGACGCGGG AGGAGGTGCA GCATATCGTC GCCGAGGGGC ATGAGAGCGG CATCTTCAGC
GAGGCCGAGC ACACCTTCAT CGACAACCTC TTCGACTTCA CCCATACCGC CGTAAGGGAG
GTGATGGTCC CCCGCACCAG GGTGGTCGCC TTCGACCTCA ACCTTTCCAA CGAGGAGATC
CTGAACCAGG TCCTGGACAA CATGTACTCC CGTTACCCGG TGTACGTGGG GAGCATCGAG
GAGACGGTCG GTTTCATCCA CGGCAAGGAC CTCTTAGGGA GGATGGTGCG CGAGCCGGAT
TTCGATATCC GCTCCATCGT CCGTCCCCCC TTCTTCGTTC CGGAGGGGAA GAAGGTGAGC
GAACTCTTGA AGGAGATGCA GAAGACCCGC GTGCACATGG CTTTCGTGGT GGATGAGTAC
GGCAGCATCA GCGGCATAGT GACCACCGAG GACCTGCTCG AGGAGCTGGT CGGCGAGATC
GAGGACGAGC ACGATGTCGG CGAGCCGAGC ACGGTGCAGA TCCTGGCCGA CGGGAGCTAC
CTGGTGGATG CCTTCATCTC CGTTTCCGAT CTGGAGGACC TGCTGGAGAT GGATCTTGGC
GAGGATCTTC CCTTCGACAC CCTGGCCGGG CTGATACTGG ACCGCATCGG CGGGTTTCCG
GAGCAGGGCG AGAAGCTTCA GTTGGGCGAG TACACCCTCA TCTGCGAGGA AGTCACCCGC
ACCGGCATCA CCAAGGTGAG AATCGGGAAA ACAGAGGGGA AATCTGGGGC TGGGGACTAG
 
Protein sequence
MDTIFVELLV IAILILLNGF FSCAEFAIIS IRKSRVAQLV ALGDQRAALV ESLQKDPHRL 
LAIVQIGVTV VGSTASAVGG VIAVDYIRPI LQLSPFAMIR NAAEPLSLTM VVAVISYLLL
ILGELVPKTI GLQYADPVAL RIAKTITFLA RIASVLVSLL SYSTRGALAL FRIKGEGKAF
MTREEVQHIV AEGHESGIFS EAEHTFIDNL FDFTHTAVRE VMVPRTRVVA FDLNLSNEEI
LNQVLDNMYS RYPVYVGSIE ETVGFIHGKD LLGRMVREPD FDIRSIVRPP FFVPEGKKVS
ELLKEMQKTR VHMAFVVDEY GSISGIVTTE DLLEELVGEI EDEHDVGEPS TVQILADGSY
LVDAFISVSD LEDLLEMDLG EDLPFDTLAG LILDRIGGFP EQGEKLQLGE YTLICEEVTR
TGITKVRIGK TEGKSGAGD