Gene GM21_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3361 
Symbol 
ID8138728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3889846 
End bp3892116 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content61% 
IMG OID644870979 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003023144 
Protein GI253701955 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATTT CAGCCAAAAA AGGGGGAACG CCCCCGCAAT ACCAGGGGCC GGAACTCTCC 
GCCGGTGAAG TGAGGAAAAG AAAGCGCGAG GCGATAATCG TCTGTATCTC GCTTTTAACC
ATCTGCCTCC TCACCTATCT GGAGATCCAC CTCTCCCGCC TAAGCGAGCA GGTGCCGATG
GGCAGCAACA TCGCCATCTT CGGCATGGTC AACCTGATCA TCCTCCTCAT CATCCTGCTG
GTCTACCTGG TCTTCAGAAA CATCGCCAAG CTGGTCCTGG AACGACGCAA GAACACGCCG
GGGGCGAAGC TGCGCACCAA GCTGGTGCTT GCCTTCGTCA CCCTCTCGCT GCTCCCGACC
ATGCTGCTGT TCTTCGTCTC CGCCGGCTTC ATCAAGAACA GCATCTCGAA CTGGTTCAAC
AAGCAGGTGG AGACCTCGCT CAACGAGTCG ATGGAGGTGG CCCAGGTCTA TTACAAGACC
TCTGCGGCCA ACGCCCTCTA CTACGGCGAG CAGATCAGCA CCGCCATCAA GGAACGGAAG
CTTCTGAACG AGGAGAACCT CCCCGAGCTG AAGGCTCTGG TGCGCCAGAA ACAGACCGAA
TACAACCTGG GGGTGGTCGA GGTCTTCTCG GCGCAGCGCG AGGAGCTGTT CCGGGCCGCG
AACGCCAAGC TGCCGCTGGG GGAATTCACC AACCCCTCGT CGGAGGATAT CCAACGCGTC
CTCTCCGGGG CCATGCTGAC CCGCGTCAAC GCCATCGGCA AGGCGGACCT GATCCGCGGC
ATCGTCCCCA TCCGCAGCAA CTTCAACGAA AAAGACGTGG TCGGGGTGGT GGTTGTCAAC
TACTACGTCC CCTACTCGCT GGTGTCCAAG ATGCGGGAGA TATCCGCCTC CTATCAGGAG
TTCCGCCAGC TGAAGATCCT GAAGAACCCG ATCAGGACCG GTTACATACT CACCCTGTTC
CTGATTACCA TGGTGATCCT CTTCCTGGCC GTATGGTTCG GGGTGTACCT TGCCCGAAGC
CTCACCATCC CGATCCAGGA ACTGGCCGAG GCGACCCGGC AGGTGGCCGA GGGGAACCTG
GACGTGCATC TGGGGGAGAG CGGGGGGGAC GAGATCGGCA TGCTGATTAC CTCCTTCAAC
CGGATGACCG AGGACCTCCG GGCGAACCAG CTCGCGCTGC AGCACACCAA CGAGGAACTG
CAAAAGAGCA ACCTCGAGCT GGAACAGCGC CGCCGCTACA TGGAGGCGGT GCTCGCCAAC
GTCACCGCCG GCATCATCTC GGTGGACAAA AACGGCCTGC TCACCACGGT CAACAAATCT
GCGGAAAAGC TCCTCCTCAT CAACACGGAC AAGGTCACCG GGCAGAACTT CCGCGAGGTG
CTGCACCCCG AGCACCTGGA CATCGTCAAG GGGCTCTTGC GGGACATGGT GCTTGCCAAG
CACGACTCCA TCGTGCGGCA GGTGGTGATC CCGATGCGCG ACGCGGAGCT CACCCTGCTC
ACCAACCTCA CCGTCCTGAA GGACGAAAAC GACTCCTTCA TGGGGATGGT GGTGGTGCTG
GACGACATGA CCTCGCTGAT CAAGGCGCAG CGCATGGCCG CCTGGCGCGA GGTGGCCCGC
AGGATCGCCC ACGAGATCAA GAACCCGCTC ACCCCGATCC AGCTCTCCGC CCAGCGGCTG
AGGAAGCGCT ACCTCACCCG CTTCGAGGGG GAGGAGGAGG TGTTCGACCA GTGCACCGCC
ATGATCATCA AGTCCGTGGA CGAGCTGAAG GGGCTGGTGA ACGAATTCTC CAACTTCGCC
CGGATGCCGG CCGCGGTCCT GAAACCAAAC GACCTGAACG GGATACTCAA GGAGGCGCTC
ACCCTCTACG ACGAGGCGCA CCGGCACATA CACTTCGTGT TGAACGCCGA CGAGGAACTC
CCCCCGATCC TTTTGGACCG CGACCAGATC AAGCGGGTAG TGATCAACCT CTTGGACAAC
GCCGTCGCCG CCATAGAGGG GGATGGAGAG GGTGTGGTCG AACTCAGCAC CAGCTACGAC
AGCCAGCTGA AGATGGTCAC TTTCACCGTT TCCGACACCG GCCACGGCAT ATCCGCCGAG
GACCGCCCGC GGCTCTTCGA GCCGTACTTC TCCCGGAAAA AGAGCGGCAC GGGGCTTGGG
CTCGCCATCG TCAACACCAT CATCACCGAC CACCACGGCT TCATCAGGGC CAAGGAAAAC
TACCCCAAGG GGAGCAGGTT CGTCATCGAG CTCCCCGCTG ACGCGGCGTA G
 
Protein sequence
MPISAKKGGT PPQYQGPELS AGEVRKRKRE AIIVCISLLT ICLLTYLEIH LSRLSEQVPM 
GSNIAIFGMV NLIILLIILL VYLVFRNIAK LVLERRKNTP GAKLRTKLVL AFVTLSLLPT
MLLFFVSAGF IKNSISNWFN KQVETSLNES MEVAQVYYKT SAANALYYGE QISTAIKERK
LLNEENLPEL KALVRQKQTE YNLGVVEVFS AQREELFRAA NAKLPLGEFT NPSSEDIQRV
LSGAMLTRVN AIGKADLIRG IVPIRSNFNE KDVVGVVVVN YYVPYSLVSK MREISASYQE
FRQLKILKNP IRTGYILTLF LITMVILFLA VWFGVYLARS LTIPIQELAE ATRQVAEGNL
DVHLGESGGD EIGMLITSFN RMTEDLRANQ LALQHTNEEL QKSNLELEQR RRYMEAVLAN
VTAGIISVDK NGLLTTVNKS AEKLLLINTD KVTGQNFREV LHPEHLDIVK GLLRDMVLAK
HDSIVRQVVI PMRDAELTLL TNLTVLKDEN DSFMGMVVVL DDMTSLIKAQ RMAAWREVAR
RIAHEIKNPL TPIQLSAQRL RKRYLTRFEG EEEVFDQCTA MIIKSVDELK GLVNEFSNFA
RMPAAVLKPN DLNGILKEAL TLYDEAHRHI HFVLNADEEL PPILLDRDQI KRVVINLLDN
AVAAIEGDGE GVVELSTSYD SQLKMVTFTV SDTGHGISAE DRPRLFEPYF SRKKSGTGLG
LAIVNTIITD HHGFIRAKEN YPKGSRFVIE LPADAA