Gene GM21_0292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0292 
Symbol 
ID8135599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp361230 
End bp362474 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content59% 
IMG OID644867912 
ProductHipA N-terminal domain protein 
Protein accessionYP_003020134 
Protein GI253698945 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones182 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGGA CGCTGCAGGT CTATTGGGAA AGCCGGCGGG TAGGGGAATT GACCCAGGAA 
GACGGGACCC TTACCTTTAG CTACGATGGC GACTACCGCT CTTCGCCCGG TGCCCAGCCG
CTGTCGCGGC AGCTCCCGCT CACTAGCGCG GAGTTTGCCA ATGCAGCGGC CAGTGCCTTC
TTTTCGAACC TCTTACCCGA GGGAGGAATC CGGCGACGGG TGGCCCGACA ATTAGGAGTC
TCGGCTGAAA ACACATTCGG ACTCCTGGAA GGAATTGGAG GGGACTGCGC GGGGGCAGTT
TCGGTGCTGC GACCGGGAGA GGTTCCTCTT CAGAGTGGCA GGTACCGTCC CATTTCTACC
GACGAGCTCG GACGTGAGCT GGCCTCGCTA CCGTCGCATC CCTTCCTTGC CGGGGAGGAG
GGAGTGCGGC TCTCCCTGGC CGGAGCCCAA AACAAGCTTC CGCTCTTTGT TGACCAAGAC
GCATACTTCA TCCCAGAGGG TAACCTTCCC TCCTCGCACA TCCTCAAGAT AGCGATCGAC
AAGCTGGAGG ATACGGTGAC GAACGAGGCG TTCTGCATGA CACTGGCGCG TCGGGTAGGA
CTCTTGGTGC CGGAAGCGCG TGTCGTCGAA ATCGCTGGGG AAAAGGTCTA TCTGGTGGAA
CGTTATGACC GTGTCCGGAC TGCTTCCGGC AGCGTGGAGC GGCTGCACCA GGAGGATTTC
TGTCAGGCAC TGGGCGTCTT TCCTGAGTTA AAGTACGAAC AGGAGGGAGG CCCCGGCTTT
GCGCAATGCT TCAGCTTGGT GGGGGGCTGG AGCGTGGAGC CGATACTGGA CACGCTGAGC
CTGCTCCGAT GGGCACTTTT TAATTTTCTA ATTGGGAATG CGGATTCCCA CGCCAAGAAC
CTCTCCTTTC TCTACCATGC CGGAAGCGTC CGGCTGGCCC CTTTTTATGA CCTGCTCAGC
ACCGCGGTCT ACGAGCGGGT CAACAACAAG TTCGCAATGA AGATGGGAGG GCAGAAGGAT
CCCCGATATC TCATGCCGCA GGATCTTGCC GCCTTCGCCA AAGAGGTGGG AATCGGCCTG
CGCACGGTGA AAGGGCAGTT GGCGGAACTG TGCCAAAAAG TGACTGATGA GATCGCGCCT
CTGGCACAAA CGTATCGCGA CAGGTATCAA GATCCTCCCA TCGTAGCAGA CATCCTCCGC
GTGGTTGATC AACGCATCCG CAAAGCCCGA ACCCTCGCCT CCTGA
 
Protein sequence
MRRTLQVYWE SRRVGELTQE DGTLTFSYDG DYRSSPGAQP LSRQLPLTSA EFANAAASAF 
FSNLLPEGGI RRRVARQLGV SAENTFGLLE GIGGDCAGAV SVLRPGEVPL QSGRYRPIST
DELGRELASL PSHPFLAGEE GVRLSLAGAQ NKLPLFVDQD AYFIPEGNLP SSHILKIAID
KLEDTVTNEA FCMTLARRVG LLVPEARVVE IAGEKVYLVE RYDRVRTASG SVERLHQEDF
CQALGVFPEL KYEQEGGPGF AQCFSLVGGW SVEPILDTLS LLRWALFNFL IGNADSHAKN
LSFLYHAGSV RLAPFYDLLS TAVYERVNNK FAMKMGGQKD PRYLMPQDLA AFAKEVGIGL
RTVKGQLAEL CQKVTDEIAP LAQTYRDRYQ DPPIVADILR VVDQRIRKAR TLAS