Gene GM21_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1079 
Symbol 
ID8136401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1265448 
End bp1266602 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content63% 
IMG OID644868690 
ProductHpt protein 
Protein accessionYP_003020898 
Protein GI253699709 
COG category[T] Signal transduction mechanisms 
COG ID[COG2198] FOG: HPt domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.57713e-26 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCGAC GGTTCCGTGA CCTCTCCATC AGGCGTAAGC TGCTCGCCAT ACTGCTTCTT 
ACCAGCGCAG TAGTGCTCTC CCTCGTTTCC ACCGCTTTCG TCATCACCGA GGCGACCGGG
TTCCGTAGCG GCATGCAGAC CGAGCTGAGT GCGCTGGCGG AGATCGTGGG AAGCAACAGC
TCAGCCGCGG TAGCCTTCAA CGACCGCAAA TCGGCGGCCG ATACCCTGGC CGCGCTGCGC
GCCAAACCGT ACATACTGAC CGCGCTGGTC GTGCTGAAGG ACCACTCACT CTTCGCAAGC
TATGTAGCGC CGGGCGCCAC GCTGCGGGAT CTAGGCTTTA TCGACGGTTC CGGCGAGAGC
GCGCGTGTGG ACGACCGGAA GTTGAGGGTC GAGTCGGCCC GCGCCAGCTT CCCGCTTGCC
TTAGGCGACC ACATCTTCGG CATCTCCCCC ATCATCCTGG ACGGCCAACA GTTGGGAACC
GTGGTGGTTC TGTCCGATTC CACGGCATTA AAGCACCGGT TGAAACCGTT CTTCCTCATG
CTGGCGGGGG TGCTGCTGGG CGCGCTGTCG CTGGTGTATT TTCTAGCCGC GAAGCTGCAA
CGCATCATCT CCGAACCCGA CTCGCACCTG GCGCAGGTCA TGAAAGCGGT CTCCACCGAC
AAGAGCTACA ACCTCAGGGC GCGAAATCAG CAGGGGAACG ATGAACTGGG GACGCTTATC
GACGGCTGCA ACGAGATGCT GAGCGGTGCC GCACCCACGG CCGAAGCGGC CGCTTCCCCC
GTGGAAACCG CACCTGCCGG GGAGGGGGGT GATACTTCCC CGCCGCCGGT GTTCGATCGG
GCGGGCCTGC TTTACCGGGT GGGAGACCCC GAGTTCATTG GCGTGTTCGT GGAGAAGTAC
CTGGCTAGCA CGGAGCAGTT GCTGGGGCTT TTGAGACAGG CCATAGCGGA CGGGGATCAA
GACGGCATGC ACCTGCATTC CCACAGCATC AAGGGGGCCG CGGCCAGCAT AGGTGCCGAG
GTGATGCGGA GCATTGCGTT CGAGATGGAG AAAAAGGGAG CGCAGCAAGA AGACGTTGAG
GGGATGACGA GGCTTTACCA GGATCTCGAG GAGGCGTTCG ACGAGTTCAG GAGGGAAGCG
GCGCAGCCTG AGTGA
 
Protein sequence
MLRRFRDLSI RRKLLAILLL TSAVVLSLVS TAFVITEATG FRSGMQTELS ALAEIVGSNS 
SAAVAFNDRK SAADTLAALR AKPYILTALV VLKDHSLFAS YVAPGATLRD LGFIDGSGES
ARVDDRKLRV ESARASFPLA LGDHIFGISP IILDGQQLGT VVVLSDSTAL KHRLKPFFLM
LAGVLLGALS LVYFLAAKLQ RIISEPDSHL AQVMKAVSTD KSYNLRARNQ QGNDELGTLI
DGCNEMLSGA APTAEAAASP VETAPAGEGG DTSPPPVFDR AGLLYRVGDP EFIGVFVEKY
LASTEQLLGL LRQAIADGDQ DGMHLHSHSI KGAAASIGAE VMRSIAFEME KKGAQQEDVE
GMTRLYQDLE EAFDEFRREA AQPE