Gene GM21_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0419 
Symbol 
ID8135728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp492371 
End bp493501 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content61% 
IMG OID644868037 
Productzinc finger SWIM domain protein 
Protein accessionYP_003020257 
Protein GI253699068 
COG category[S] Function unknown 
COG ID[COG4279] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTT TTTACGAATG GTATCCCCCT TACGTTTCCG TGGCGGAGAG GCGCGCCAAT 
GCGAAAGCCG AGATGGAGAA GTTGCGCAAG AAGAAGGGGG TGAACGTCCA GCCGGTGGAG
ATCTCCGGGC GCACCATCGC CTCATCCTTC TGGGGTAAGG GGTGGTGCGA TCATATAGAG
TCCTTCCACG ACTACGCCAA CCGCCTGCCG CGCGGACGCT CCTACGTGAG AAACGGCTCG
GTCTGCCACC TGGAAATAAA GCCGGGAAGC ATCGAGGCGC TGGTGAGCGG TTCGATGCTT
TACAACGTCG CCATCACCAT CGCCCCGATC TCGCAGGCTA AATGGAACGC CGTCAAGGCC
GCCTGCGCGG GCCAGATAGG CTCCCTCATC GACCTGTTGC GCGGCAGACT CGCCAGCGGC
GTCATGGAAG TGGTGTCCCA TCGGAGCACC GGCCTCTTTC CACTGCACAA AGAGATCCGC
TTCAGCTGCG ATTGCCCCGA TTCCGCCAAG ATGTGCAAGC ACATAGCAGC CGTTCTTTAC
GGAGTGGGGG CGCGTCTGGA TCACGCGCCG GAGAAGCTTT TCCATCTGAG AGGCGTGAAC
CACGAAGAGA TGGTGGACGT GGCGAGCACG ATAGGTGTGG CGACCGGTGC GGGGAGTTCC
CGGCGCCGGT TGGCAGCGAC AAGTCTGGAC GACATCTTCG GCATCGACCT GGCGGGGGGC
GGATCGGAGA GCGCAGACGC GGCAGAGGCC AAGGATGCGC CGATTCCGAA GGCGAAAAAA
CCCGTGGCGG CCCGTCCTGC CACCGCAAAA AAAGAAGCGA AGACAGAGGC GCAAAAAGCG
ACACAGATGG GAGCGGCGCT ACCGGTTAAG GAGGTAAAAG TACGCGCGAA GGTAGTAGTT
GAGACACCTC TTGTGGCGCC CACCACGTCG ACACCGTTTC CAAGACGTCT CACCGGGAAA
GTAATCCTTA CCTGGCGCAG TTCCCTGCGA GAGACCCAGG CGGAGTTCGC CTCACGGATC
GGCGTTTCCG CCGGATGTAT CTCGCAGTGG GAGAAAAAGC TGAGACAGAC CCTTCAGGTG
AGGGAGCGCG CGTTGGCTGC GCTGCAAAAG GCATGGGTCG ACACTCATTA G
 
Protein sequence
MSRFYEWYPP YVSVAERRAN AKAEMEKLRK KKGVNVQPVE ISGRTIASSF WGKGWCDHIE 
SFHDYANRLP RGRSYVRNGS VCHLEIKPGS IEALVSGSML YNVAITIAPI SQAKWNAVKA
ACAGQIGSLI DLLRGRLASG VMEVVSHRST GLFPLHKEIR FSCDCPDSAK MCKHIAAVLY
GVGARLDHAP EKLFHLRGVN HEEMVDVAST IGVATGAGSS RRRLAATSLD DIFGIDLAGG
GSESADAAEA KDAPIPKAKK PVAARPATAK KEAKTEAQKA TQMGAALPVK EVKVRAKVVV
ETPLVAPTTS TPFPRRLTGK VILTWRSSLR ETQAEFASRI GVSAGCISQW EKKLRQTLQV
RERALAALQK AWVDTH