Gene GM21_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1368 
Symbol 
ID8136696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1612778 
End bp1614433 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content62% 
IMG OID644868982 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003021185 
Protein GI253699996 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00000126937 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAGACG AGCTGACGCC GGAGCTGCGG GACGAACTAC CGAAGGCTCC AAAGGACATC 
CCCGGGCTGG ATCCGAAGGC CACACTGTTG CTGCAGGATC TGCAGAGCAG GCTGGCGACG
GCCGAGGCCG AGAATGTACG GTTGCGCCGC GTCATCGAAT CCCAGCGCTG CGACGAGGAT
GGTTACCGCT TCCTCTACAA CGACACCCCG GTCATGCTGC ACTCCATCGA CCGCAACGGG
CTGCTCCTGG GAGTGAGCAA CTATTGGGTC GAGGTGCTGG GGTACCAGCG GGAGGAGGTG
ATCGGGCGCA AGTCGACCGA TTTCCTTACC GAGGAGTCGC GCCGCTACGC CGAAGAGGTC
GTCCTGCCCG AGTTCTTCCG CACCGGTTTT TGCCGCAACG TCCATTATCA GATGGTGAAA
AAGAGCGGGG AGCTCCTGGA CGTGCTTCTT GTCGCCTCCG CGGAGCGGGG ACCCCGGGGT
GAACTGCTGC GCTCCTTTGC CGTCATGACC GACGTGACAG AATGGAAGGC TGCGGAAAAG
GCTTTGAAGG AGAGCGAAGA GCGCTACCGC ATGATAGTAG AGACCTCCCA AGAGGGGATA
CTCGCGGTCG ATGCCGAGGG GCGCATAAGC TACGCCAACC GTCAGTTCGC CGAGATGCTG
GGGCTGGAGG TCGGCGAGGT CGGCGAGGTC GGCGAGGTCG TCGGGCGTTT TTTCCTTGAG
TTCGTCGACG GCTGCCTGCA CGACGATGTA GCCGTCAAGA TCAAGAACCG GGAAAATGGG
CTATCCGAGC ATTACGAGAC GATCTTGCTG CGCAAGGGGG GCTCCAGGAT GTGGGCCGGC
GTCTCCGCCA TTCCCGTAAA AGGCCCAAAC GGCGAGTTCT CCGGGGCGTT CGCCATGGTC
TCCGACATCA CCAAGCGCAA ACAGGCTGCC GAGGAGATCG AGGTGCTGCA CACACATCTT
TCGGCGCGCG CCTGCGAACT GGAGCTTGCC AACGAGGAGT TGGAGGCTTT CAGCTACACC
GTTTCCCACG ACCTCAGAAG GCCTCTCACC GCCATAAACG GCTTCAGCCA GGTGCTGCTC
GAGCTTTACG GATCCGGTAT GGACCCGCAG TGCAGGGAGT ACGTACGGGA GATCCTAAAC
GGCAGCATCA GGATGAACCA CCTGATCGAC ACGCTGATCA ACTTCTCGCG CCGAAGCGGA
GGAGAATCGG TCCGGGAAGA GGTGGAGATA ACCGAGCTGG TGGAGGAACT TTGCGCCGAA
CTGCAACGCA CCGAGCCTCA GCGCAACGTT TCCCTGCTCA TCCAGCCAGG CGTGCGCGGG
ATGGCCGACG CGCATCTTTT GCGGGTCGTC CTCGACAACC TGCTGGGGAA CGCCTGGAAG
TACTCCGCCA AAATGGAGTC AAGCGAGATC GCATTCGGCA CGGTCGATCA CCTGGGGAAG
ACGGCCTACT TCGTCCGGGA CAACGGCGCA GGTTTCGACA TGGCCCTGGG TGACCTGCTG
TTCAAGCCGT TCCAGCGCCT CCACGACGCC CGCGATTTCG AAGGGACCGG CATCGGCCTC
GCCAGCGTGC AGCGCATCAT TCAGCGGCAC GGGGGGCAGA TCTGGGCCGA GAGCGAACCC
GGCAAAGGGG CGACCTTCTA CTTCACCCTG GGCTAG
 
Protein sequence
MQDELTPELR DELPKAPKDI PGLDPKATLL LQDLQSRLAT AEAENVRLRR VIESQRCDED 
GYRFLYNDTP VMLHSIDRNG LLLGVSNYWV EVLGYQREEV IGRKSTDFLT EESRRYAEEV
VLPEFFRTGF CRNVHYQMVK KSGELLDVLL VASAERGPRG ELLRSFAVMT DVTEWKAAEK
ALKESEERYR MIVETSQEGI LAVDAEGRIS YANRQFAEML GLEVGEVGEV GEVVGRFFLE
FVDGCLHDDV AVKIKNRENG LSEHYETILL RKGGSRMWAG VSAIPVKGPN GEFSGAFAMV
SDITKRKQAA EEIEVLHTHL SARACELELA NEELEAFSYT VSHDLRRPLT AINGFSQVLL
ELYGSGMDPQ CREYVREILN GSIRMNHLID TLINFSRRSG GESVREEVEI TELVEELCAE
LQRTEPQRNV SLLIQPGVRG MADAHLLRVV LDNLLGNAWK YSAKMESSEI AFGTVDHLGK
TAYFVRDNGA GFDMALGDLL FKPFQRLHDA RDFEGTGIGL ASVQRIIQRH GGQIWAESEP
GKGATFYFTL G