Gene GM21_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0320 
Symbol 
ID8135627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp394729 
End bp396051 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content63% 
IMG OID644867937 
Productprotein of unknown function DUF195 
Protein accessionYP_003020159 
Protein GI253698970 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones118 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCTG ACCTTTTTCA ATTGATCATC CTGCTATCTT GCACCGCCAC CTTCGTGGTC 
GCGTTCCTTT GCTACCTGCA TCTCAAACGG GCGCACATTG CCGAAGCGCG CTTCGACCAA
CTGGAAAAGG GGCTGGAGCG ATTGGAGCGC ACCCTCCAGA CGGAACTGGG GCGAAACCGC
GAGGAGCTGG GGGGGAACCT CAGGCAGTTC GGGGAGGCGG TGCAAAAGCG GATGGTGGAT
ATCGCCTCGC TGCAAAAGGG ACAGCTGGAA GGGTTCACAC AGCAGCTCGG CAGCCTCACC
GCGAGCAACG AGCAGCGCCT GGATAAGCTG CGCGAAACGG TAGAGCTGCG CCTCAAATGG
CTGCAGGACG ACAACTCGAA AAAGCTGGAG CAGATGCGCG CCACGGTCGA CGAAAAGCTG
CACGAGACCC TGGAGAAGCG GCTGGGCGAG TCGTTCAAGC AGGTAAGCGG CCAACTGGAG
CAGGTCCACA AGGGGCTGGG GGAGATGCAG TCGCTTGCCG CAGGCGTCGG CGATCTGAAA
AAGGTCCTCT CCAACATCAA GACCCGCGGC ACGCTCGGCG AGGTGCAGCT GCACAACCTT
TTGGAGCAGA TACTCACCCC GGACCAGTAC GGCGCCAACG TCGCCACGAA GCCTGGGAGC
GACGCGCGGG TCGAATTCGC CATTCGCCTC CCCGGCAAGG ACGACAAGCC GCTCTGGCTC
CCGGTCGACG CCAAGTTCCC GCAGGAGGAC TACCTGCGGC TGGTCGAGGC CCAGGAGCAG
GGGAACCAGG TCGCGATACA GGAGGCGACC AGGCAGTTCG ACAAGACCGT CGCCGCCATG
GCCAAGCTCA TCTGCGAGAA GTACTTGGCC CCTCCCGACA CCACCGATTT CGCCGTGATG
TTCCTGGCCA ACGAGGCAAG CTACGCGCAG GTGTTAAGCC GTCCCGGGCT CTTCGACGCC
ATCTTGCGCG AGCACAAGGT CATCGTCGCA GGCCCCACCA CCATCGCCGC GCTGCTTTCG
TCGCTCAGCC TCGGCTTCAA GACGCTCACC ATCGAGAAGC GCAGCAGCGA CGTCTGGCGG
CTATTGGGCG CGATAAAGAC CGAGTTCATG ACCTTTGGGA CCCTGCTGGA AAAAACCAGG
AAAAAGTTGG ACGAGGCCTC CTCCAGCATC GACACCGCCG CCACCCGCAC CCGCCGTATT
CAGCGCAAGA TGCAGGGGAT CGAACAGTTG CCGGAGCACG AGGCGAAGGG ATTGCTGGGA
GGGGAGCTTG GCGCGGCCCC CGAGCCGGAG AGCGGCGAGG TGATCCTGAT CGACGAGGCG
TGA
 
Protein sequence
MRPDLFQLII LLSCTATFVV AFLCYLHLKR AHIAEARFDQ LEKGLERLER TLQTELGRNR 
EELGGNLRQF GEAVQKRMVD IASLQKGQLE GFTQQLGSLT ASNEQRLDKL RETVELRLKW
LQDDNSKKLE QMRATVDEKL HETLEKRLGE SFKQVSGQLE QVHKGLGEMQ SLAAGVGDLK
KVLSNIKTRG TLGEVQLHNL LEQILTPDQY GANVATKPGS DARVEFAIRL PGKDDKPLWL
PVDAKFPQED YLRLVEAQEQ GNQVAIQEAT RQFDKTVAAM AKLICEKYLA PPDTTDFAVM
FLANEASYAQ VLSRPGLFDA ILREHKVIVA GPTTIAALLS SLSLGFKTLT IEKRSSDVWR
LLGAIKTEFM TFGTLLEKTR KKLDEASSSI DTAATRTRRI QRKMQGIEQL PEHEAKGLLG
GELGAAPEPE SGEVILIDEA