Gene GM21_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0549 
Symbol 
ID8135860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp671045 
End bp673183 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content62% 
IMG OID644868162 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003020381 
Protein GI253699192 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.338238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTAA TTAACCCCAA TGGAAATGAC CACCCCCCCG GTCCTTCCGA GAATAGCGTT 
ACCGCGACCG GTGCCGACGA GCCGGCCGGC AGGAAGGGAC GCAGGAACCT GAACAGGTGG
CTCCTGGCCG GGCTCCTTCC GCTTGCAGCT TTGGCGCTGC AATGGCTTTT ATGGGAAAGA
CTGCAGCCCT ACGTCTGGCT ATTCTTCTAC CCTGCCGTTT TCTTAAGTTC CTGGGTGGGA
GGTAGGGTCG CAGGGCTTTT GGCTACGGCA TTTTCGGCTG TCGTCGTGTG GTATCTCTTC
ATCCCCCCCC GCTACAGTTT CTCTCTGGTT CACCCCTCGA CGATCATCGC GCTGCTCATC
TTCGCGGGCA TGGGCGTCCT ATTCTCCCTG TTCCACGAAC GGCTGAGAAG AGCGGGCCGG
CAGACCCGGC TGGCGCTGGC GGAGGCTATC TTGGGACGGG AACACCTTGA GCATCTGATG
GAGGAGCGGA CACGCGAACT GGTGAGCCTG GTCGACGAGT TGCGGCGCAA GGAGGCGGAC
CTACGGAGGT CCCAGGAGCT GGCCAAGATC GGCAGCTGGA CCTACCAGGC AGACGGGAGG
CTTGAGTGGT CGGACGAGCT TTACCGCATC TACGGCCTGG CTCGCGGGAA ATTCACACCG
GACGTTCCAT CCTTTCTTGA GATCATCCAC CCGGATGACC GCGCTCACAT GCAGCATTGG
GTTAAGGCCT GCCTGGGAGG GGAGAACCCC GGCGAGTTGG AGTTCCGGAT CGTTCGTCCC
GACGGGAGCG TGCGCTTCAT CAGCGGCCAC GGCGCTCTGA GCCGCGATAG CGAGGGGCGA
GTCACCGGCA TGTCGGGGAC CGGCCAGGAC ATAACCGAGC GGAAAATCGC TGAGTCCGCG
CACCGGGAGA GCGAAGAGCT GCTGAAGCTT TTCATCGAAT ACGCGCCGGT GCCGCTTGCC
ATGTTCGACC GCGAGATGCG TTACCTCTAC GCGAGCCGGC GCTGGCGCAG CGATTTCGGG
TTGGGAGACC GTTCCCTCGT CAAGGTCAGT CACTACGCCA TATTCCCGGA GATCCCGCAG
CACTGGAGGG AGCTGCACCG GCGCGGACTC GCCGGAGAGA TCCTGCGCGA GGAGGCCGAA
GAATTCCGGC GCGCCGACGG CACGGTCCAG TGGCTGCGCT GGGAACTCCG CCCCTGGTAC
GACGCCGGGG GGAAGGTGGG GGGAATCGTC ATCTTCAGCG AGGACATCTC CAACAGAAAA
TGCGCCGAGG ACGCGCTGCA AAAACTCAAC GAGGAGCTGG AACTGCGCGT GGCCCAGCGG
ACCGAGACGC TGGACGTGAT CTTAAGCGAG CGGGAAGCGC AGAATGCGGA GCTGCAGCGC
GCCTATCACG AGCTCGAGGC GGAGACGGCG CGGCGTATCC GCATGGTGGA GGAACTGCGG
CAAAAGGAGC AGTTGCTGAT CCATCAAAGC CGGCTCGCGG CCATGGGGGA GATGCTCGGT
TACATCGCGC ACCAGTGGCG CCAGCCGCTG AACGTCCTCG GGCTGCACCT GCAGGTGCTG
GGGCTTTCCT ACCAGCACGG GACCTTCAGC CGGGAACTCC TGGAGGAGAG CGTGGGAAAA
GCGATGGGCA TCATAAGGCA CCTCTCCAGG ACCATCGACG ACTTCCGCGA CTTCCTGATC
CTCAACAAGG AGAAGACCCT GTTCCAGGTC GACGAGGTGA TCGTAAAGAC GGTCGGGCTC
ATCGAGGAGC ATCTCAAGAA GGCGGGGGTC CGCATCGAGG TCGCCTGCAC CAACCCGCCG
GAGGTGAACG GCTTTCCCAA TGAATACAGC CACGTGATTC TGAACCTTTT GACCAATGCG
AAGGACGCCT TTTTGGAGCG TCAGACGGAG CATCCGGTGA TCAGGGTGCA TTCGGGGTCC
GAACAGGGGA AGACGGTGGT GACCATCGCC GACAACGCCG GGGGGATCCC GGAAGAGATA
ATCGACAAGA TCTTCGACGC CTACTTCACC ACCAAGGGGT TGGGAAAAGG AAGCGGGGTC
GGCCTGTTCA TGTCCAAGAT GATCATCGAG AAAAACATGG GGGGCAGCCT CACCGTTCGC
AACGTCAACG GCGGCGCCGA ATTCAGGATC GAGATCTGA
 
Protein sequence
MQLINPNGND HPPGPSENSV TATGADEPAG RKGRRNLNRW LLAGLLPLAA LALQWLLWER 
LQPYVWLFFY PAVFLSSWVG GRVAGLLATA FSAVVVWYLF IPPRYSFSLV HPSTIIALLI
FAGMGVLFSL FHERLRRAGR QTRLALAEAI LGREHLEHLM EERTRELVSL VDELRRKEAD
LRRSQELAKI GSWTYQADGR LEWSDELYRI YGLARGKFTP DVPSFLEIIH PDDRAHMQHW
VKACLGGENP GELEFRIVRP DGSVRFISGH GALSRDSEGR VTGMSGTGQD ITERKIAESA
HRESEELLKL FIEYAPVPLA MFDREMRYLY ASRRWRSDFG LGDRSLVKVS HYAIFPEIPQ
HWRELHRRGL AGEILREEAE EFRRADGTVQ WLRWELRPWY DAGGKVGGIV IFSEDISNRK
CAEDALQKLN EELELRVAQR TETLDVILSE REAQNAELQR AYHELEAETA RRIRMVEELR
QKEQLLIHQS RLAAMGEMLG YIAHQWRQPL NVLGLHLQVL GLSYQHGTFS RELLEESVGK
AMGIIRHLSR TIDDFRDFLI LNKEKTLFQV DEVIVKTVGL IEEHLKKAGV RIEVACTNPP
EVNGFPNEYS HVILNLLTNA KDAFLERQTE HPVIRVHSGS EQGKTVVTIA DNAGGIPEEI
IDKIFDAYFT TKGLGKGSGV GLFMSKMIIE KNMGGSLTVR NVNGGAEFRI EI