Gene GM21_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1354 
Symbol 
ID8136682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1596528 
End bp1597757 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content60% 
IMG OID644868968 
ProductSporulation domain protein 
Protein accessionYP_003021171 
Protein GI253699982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.89794e-26 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCAATA TTCGAATGCT TGTGGCGTGG ATGTTGTTGT CCGTTTTTTG CTGTCCGCTG 
ATATCACATG CGGCGCAGCC TGATGAAGCT GCGATGATGG CTACTGCGAA GGGCCATTTT
CAAGACGGCG GCTATTACTA CGCTTCAACC TGGCTGGAGC GGATACTGAA AAAATGGCCC
AAAACCGGTC AGCGCGAGGA AGCTCTGGTG ATGCTGGCCA AGTCGTATGC CGCTACCGGG
CGGGAGGAGA AGGCGGCGCG TACGGTAAAG ACCCTGTTGA AAGAATATCC CCAGACGGCC
GCCAAACTGG ATCCGGAGAT GCTGAAGCTG GCCCAGGAGA CCTACGCTGA GGCGCCGCCG
GCTTTCCAGG CAGCCGAGGC GCCTGCACCG GCGCCTGTCG CTCAGGCTGT TTCGGAAACG
GCGAAAGTTG CAGAGGCCGC GGTTGCCCCT GCTGCCGCCG GAGTCAAGTC CGCTGTTGCG
GTCGCTGTCC CTGCTCCCGC GCAACCAGCC GCTTCCGCCC AGGCCGTACC CGAGGCCCTC
AAGGAAATCG CTTCTACCGA GTCCACCGAA CCGGCCGCGG CGAAGCTCCC TGCTGACGCC
AAGACGCCTG TCGCAGCGCA GGCGTCGGCT AAGCCCGATG TGGCCACACC TGCTGCGGCG
CAAGTCGCCA TAGCTGCTTC CGTCCCGGCC AAGCCGGTGA TTTTGCCTGT CGCTGCCGCA
AGCGCGGCTG AGACGGAAAC AGCCTGCCGC GACAACTCCG CAACCGCCAC GGGGACCTAT
GCCATAGAAC TTGGCGAGTT TATCGGGAAG AACTCGTTGG TCAGGGCGAA GAAAGCGGTC
AAGAAAGCGG GGCTTGTGCC GGTTGTCGCG CAGGGGCGCC AGAAAGTTGA AGTGATGTTG
CGGATACTGA TGGGTGAATA CCACGACGAA GGCGCAGCAA AGAAAATGCT GAATAAACTG
CGAAAGGCCG GTGCCGAGCA TTTCATGCTC AAAGACAAAG GGAGGACCTT CCGCGTTTAT
GCCGGGTCCT ACTTCGAGCA CCAGGGCGCT CTTGACGAGC AGAAGCGTCT CTTGGCCCAA
GGCCTTGATT CGGAGTTGAG GGAGGCAACC GTCACTGTCT CGACCTACCT CATCAACGCC
GGCTGTTTTC CCACGGACCA GGCCGCCAAG GGGAAGCTGG CTGAGTTGGA GCGTATGGGA
CTGAAAGGTA AGGTACTCCC TCCTCAATAG
 
Protein sequence
MLNIRMLVAW MLLSVFCCPL ISHAAQPDEA AMMATAKGHF QDGGYYYAST WLERILKKWP 
KTGQREEALV MLAKSYAATG REEKAARTVK TLLKEYPQTA AKLDPEMLKL AQETYAEAPP
AFQAAEAPAP APVAQAVSET AKVAEAAVAP AAAGVKSAVA VAVPAPAQPA ASAQAVPEAL
KEIASTESTE PAAAKLPADA KTPVAAQASA KPDVATPAAA QVAIAASVPA KPVILPVAAA
SAAETETACR DNSATATGTY AIELGEFIGK NSLVRAKKAV KKAGLVPVVA QGRQKVEVML
RILMGEYHDE GAAKKMLNKL RKAGAEHFML KDKGRTFRVY AGSYFEHQGA LDEQKRLLAQ
GLDSELREAT VTVSTYLINA GCFPTDQAAK GKLAELERMG LKGKVLPPQ