Gene GM21_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3271 
Symbol 
ID8138628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3802343 
End bp3803803 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID644870880 
Productprotein of unknown function UPF0027 
Protein accessionYP_003023055 
Protein GI253701866 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones119 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTAC CGGCGGCGCT AAAGCGGATC ACAGATCAGT TGTGGGAACT ACCGGTAAGC 
TACAAGGAAG GGATGCTGGT CCCCGCCCGA ATCTTTGCCT CAGAGAAATT GGTCCGGGAG
ATGGATGCCG GCGTTTTCGA GCAGGTCAGC AACGTCGCCA CGCTCCCCGG CATCCAGAGA
TACGCCTACT GCATGCCCGA CGGCCACTGG GGCTACGGCT TTCCCATAGG GGGTGTAGCC
GCCATGGATC CGGGTACCGG CGTCATCTCG CCGGGAGGGA TCGGTTTCGA CATAAACTGC
GGCATGCGGC TCGTATTGAC GAACCTCACC GCCGACCAGG TCATCCCCAA ACTGCATCAA
CTGGTCGATC GTCTCTTCGC CCGGATACCC ACCGGCGTCG GATGTCATGG GTTCGTGAAG
CTGAAGCAGG ACGATTTTCG TTCCATAGTG CAGCAGGGTT CGCGCTGGTG CCTGAAAAAC
GGCTTCGCTA CCCAGGAAGA TCTGGATATG ACCGAGGAAG GGGGCTGCTT TTCCGGCGCC
GACGCCTCAC ACATAAGCGA CAAAGCGGTG GAACGCGGCT ACAACCAGCT CGGCACACTG
GGGTCCGGCA ACCACTACTG CGAGATCCAG GTGGTGAAGC CTGAGAACGT CATGGACGCG
GAATTGGCCG CAGCCTTCGG GCTTACCATG GTACCCAACC AGGTGGTGAT CATGTTCCAT
TGCGGCAGCA GGGGCTTCGG GCACCAGGTG GCGACGGACT ACCTGAAGCT GTTCCTCTCC
GTCATGGGGC GCAAGTACGG CATAAAGATC GTCGACCGCG AACTTGCCTG CGCTCCTTTT
CACTCGCCCG AAGGTCAGGC CTACTTCAGC GCGATGAAGT GTGCCGTCAA CATGGCCTTT
GCCAACAGGC AGGTGATCCT GCACCGGATC AGGGAGGTGT TTTCCGACCT GTTCCACGCC
TCGCCCGACG AACTCGGGCT GCGCATGGTG TACGACGTGG CGCACAACAC GGCAAAGCTG
GAACGGCACG AGGTAAACGG GACCCGGAAG GAACTCCTGG TGCACCGCAA AGGATCCACC
CGCGCCTTCG GCCCTGGCGC TGCAGGGCTA CCCGGATGTT ACGCGAAGAC CGGCCAGCCT
GTCATCATAG GCGGGAGCAT GGAGACCGGC TCCTATCTGC TCGCGGGGAT GCAAAGCGGC
GCCGACGCCT TCTTCACCAC CGCCCACGGC AGCGGCAGGA CCATGAGCAG ACATGAGGCG
AAGAAAAATT TCAGGGGCGA CAAGCTGCAG CGTGAAATGG AGGCGCGGGG GATCTACGTC
CGCACCGACT CGTTCGGCGG GTTGGCGGAG GAAGCGGGAC CCGCATATAA GAATATAGAC
GAGGTCGTTG AAGCCACCGA ACTGGCCGGC TTGAGCAAGA GGGTGGCGCG CCTGGTTCCG
ATCGGCAACA TCAAGGGGTA G
 
Protein sequence
MNVPAALKRI TDQLWELPVS YKEGMLVPAR IFASEKLVRE MDAGVFEQVS NVATLPGIQR 
YAYCMPDGHW GYGFPIGGVA AMDPGTGVIS PGGIGFDINC GMRLVLTNLT ADQVIPKLHQ
LVDRLFARIP TGVGCHGFVK LKQDDFRSIV QQGSRWCLKN GFATQEDLDM TEEGGCFSGA
DASHISDKAV ERGYNQLGTL GSGNHYCEIQ VVKPENVMDA ELAAAFGLTM VPNQVVIMFH
CGSRGFGHQV ATDYLKLFLS VMGRKYGIKI VDRELACAPF HSPEGQAYFS AMKCAVNMAF
ANRQVILHRI REVFSDLFHA SPDELGLRMV YDVAHNTAKL ERHEVNGTRK ELLVHRKGST
RAFGPGAAGL PGCYAKTGQP VIIGGSMETG SYLLAGMQSG ADAFFTTAHG SGRTMSRHEA
KKNFRGDKLQ REMEARGIYV RTDSFGGLAE EAGPAYKNID EVVEATELAG LSKRVARLVP
IGNIKG