Gene GM21_0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0919 
Symbol 
ID8136240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1095071 
End bp1096750 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content63% 
IMG OID644868535 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003020744 
Protein GI253699555 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.4473e-32 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAATGC CCCTCTTCCG GCTGCTCTGT CTGGTCGCGA CCGCCATCTC CTTTCTGGTC 
ATCATCCCGA CAAACTACCT GCATCACCGC CCCGCCGAGA TAAACGCCGC CGTCTTCTGC
TTCGGGCTCA GCGCGCTCTG GCTCTTCCGC CGCGCCTGCC AGGGAAAGCA CCACCTCAAG
TCCTTCTTCT TCCTGCTGCT TCTCCTGCTG AACCTGGTCT GGTTCCCAAG CGGCGGGACC
AGCGGGAGCT CTGGCTATTT CTTTTACTGC CTCTTCCTCT ACCCCCCTAT CTTTTACCGG
GGCAAGACGC GCTGGCTGCT TCTTGTCCTC GCGGTTGCCG ACGCGGTGAT GCTGCTGGCC
GCAGAGATCG TTTTCCCGGG CTCCGTGGTC CATTACGCCG CCGCTTATGA CAGGACGGCG
GATCTGGCCG TAGGCCTCGT GATGAGCGCG TTTTGCTGCT CAATGATGTT GTGGCTGCTT
TTGGAGCAGC ACGACCGCGA GCAGCGGCGG CTTATGTCGC TGAACGAGGA GCTGCGCATG
GCCATGGACG ACCGGGCCTG CGTCGAGAGC TCCATGCTGC AAAATCGCGA GCTGCTGCAC
GCGGTGATCG AAGGGACCAC GGACGCGGTC TTCGTCAAGA ACCTGCAGGG GCAGTACCTC
CTCTTCAACC GGGCGGCCGA GGCCATGACA GGGGTAAGCG CCGGCCAGGC GCTGGGAAAC
GACGATCACG CCGTTTTCCC TCCGGATGTG GCGCAAAAGG CGATGGCGAA GGACCGATAC
GTCCTGGAGA CCCGCGAGCC TCAGACCAAC GAGGTCAATT TAGCTTCTCC CGACGGGGAG
ACGAGGATCT TCGAGGCGAT CAAGGGGCCG CTGCAGGACG GCAAAGGAAA CCTCGTCGGC
GTTTTCGGGA TCTCCAGGGA CGTCACGGAA AGGCGCCGCA TGGCTGAGGA ACTCAGGAAG
CTGAACGAGG AACTGGAGCT GCGTGTCATC GAGCGGACGG GGCGGTTGGA GGCCGCCATG
CGGGAGCAGG AGAGCTTCAG CTATTCGGTC TCGCACGACC TTCGGGGGCC GCTGCGCCAC
ATAAACAGCT ACACGGCTAT CATCGAGGAG GAGTTCGGCG CCGAGCTGCC GGCGGAGGCG
AAAAGGTACA TGGACCGCAT CCGGAATTCC AGCCGGATCA TGGGGGACCT GATCGACGAC
CTGCTGGAAC TCTCCCGGAT CGGCAGGTCC GAACTGAACA AGGTCCCCGT CAGCCTGAGC
GAGCTCGCCG GCGGCATCGG GCACCAACTG CTGGAAAGCG AGCCGGCGCG CCAGGCCGAG
CTGGTGATCG AGCCCGGCCT TAGGGTGCAT GGGGACCGGG TTCTTCTAAG GCAGCTTTTG
GAGAACCTGC TCGACAACGC CTGGAAATAT TCCAGGGGGA GGGGATGCGC CCGTATCGAG
GTGGGTAAGT CGGACTGGGG CGAGCGGGAC GCGTTCTTCG TCCGCGACAA CGGGGTGGGC
TTCGACATGA CCTACCAGGA CAAGCTGTTC GGGGCCTTCC AGCGCCTGCA CGGCTCGGAG
TTCGAGGGAA CCGGGATCGG CCTCGCCACC GTGAAGCGGA TCGTGGAGCG TCATGGCGGA
ACTGTCTGGG CGCAGGGGGA GGTCGATGCC GGGGCCACCA TCTACTTCAC CCTCTCCTGA
 
Protein sequence
MEMPLFRLLC LVATAISFLV IIPTNYLHHR PAEINAAVFC FGLSALWLFR RACQGKHHLK 
SFFFLLLLLL NLVWFPSGGT SGSSGYFFYC LFLYPPIFYR GKTRWLLLVL AVADAVMLLA
AEIVFPGSVV HYAAAYDRTA DLAVGLVMSA FCCSMMLWLL LEQHDREQRR LMSLNEELRM
AMDDRACVES SMLQNRELLH AVIEGTTDAV FVKNLQGQYL LFNRAAEAMT GVSAGQALGN
DDHAVFPPDV AQKAMAKDRY VLETREPQTN EVNLASPDGE TRIFEAIKGP LQDGKGNLVG
VFGISRDVTE RRRMAEELRK LNEELELRVI ERTGRLEAAM REQESFSYSV SHDLRGPLRH
INSYTAIIEE EFGAELPAEA KRYMDRIRNS SRIMGDLIDD LLELSRIGRS ELNKVPVSLS
ELAGGIGHQL LESEPARQAE LVIEPGLRVH GDRVLLRQLL ENLLDNAWKY SRGRGCARIE
VGKSDWGERD AFFVRDNGVG FDMTYQDKLF GAFQRLHGSE FEGTGIGLAT VKRIVERHGG
TVWAQGEVDA GATIYFTLS