Gene GM21_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4044 
Symbol 
ID8139418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4626920 
End bp4628119 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID644871660 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003023818 
Protein GI253702629 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.571618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCC CCGCAGATCT CCCCATCCGC GTCAAATTCT TCGGACTGAT GTCGCTACTT 
TTAATCGCCC TCTTGCTGGC AAGCGGCCTC TTCATCTACA ACCGGCAGAA AGAGTTCGTC
GTCAGGTTCG CCGTCGACAA CGCCCGCAGC TTCGCCACCA CCGTAATAGA GACCCGCGAG
TACATGTCTT CCGTGGTCAG GGACGAGCCC GAACAAAACT ACAACCTGGT CCCCCAGGTA
GTGGCCACCC AGGTCGCAAA GAGGGTCACC CAAAACAGCA AGTTCTACCT GCGCCAGGTC
TCGCTGCGCT ACCGCAATCC CAGCAACAAA CCGGACGCCT ACGAGACGAA GCAACTGCAG
TACTTCATCA ACAATCCCAA CGCCGAGGTC TACAGCATCG TGCAAAGCGG CGATATCAGC
CTCTTCCGTT ACCTGCAGCC GATGCGCGCC ACCGCCTCCT GCCTCGAATG CCACGGCAGC
TATGAAACCG CACCCGATTT CGTGAAGAAG CGCTTCCCCC CAGGCCACTA TTCCTACAAC
TACAAGGTGG GCGAGGTGAT CGGGGCGGTC TCGGTCAGCA TCCCGGTCAA GGACCTCTAC
GCCCAACTGG GCGCTAACCT CAAACTCGAC CTCCTTTTCC GGGCTATGGT CTACGTGATC
GTCATCCTGG TGATGGGATT CATCATGAGC CGCCAGATCC TCAATCCCAT CAAGCTCCTC
TCCGAACGCA TGATCGCCGT GACCCGCACC GGCAACTTCA AAGACAAGCT GCCGCAGAAG
ACCAACGACG AGATCGGCAT GCTGATCGGC TCCTTCAACG AGATGATGGA CGAACTCTCC
AGCCGCACCG TCCAGTCGAA AGAGGCGGAC GAGCGCTACC GCCGCTTCAT CGAGGTGGCC
GCCTCGGCGG TGATCACCTT CCTCAAGGAC GGCAAGATCG TCATCGCCAA CCAGAAAGCC
GAGTCCCTCT TCGGGCGCTC GCGGCAGGAA CTGCTGGGGG AATCGATCTT CAGCTTTCTG
GAGGATGGGG CAGCGCTCAA GGATAGGCTT TCCACGCAGA CGGAGTTCCG GGACGAAGCG
TCCCGCCAGA TAGTGAACGG CAGCGGCGGA AAACGGACGG AGGTGGAGAT GGTGCTCTCC
GTTTCCAGGA CGGACCGGGA GCCGATGTTC ACCGCCATCC TCAGGGAGCG CAGGGGATAA
 
Protein sequence
MTRPADLPIR VKFFGLMSLL LIALLLASGL FIYNRQKEFV VRFAVDNARS FATTVIETRE 
YMSSVVRDEP EQNYNLVPQV VATQVAKRVT QNSKFYLRQV SLRYRNPSNK PDAYETKQLQ
YFINNPNAEV YSIVQSGDIS LFRYLQPMRA TASCLECHGS YETAPDFVKK RFPPGHYSYN
YKVGEVIGAV SVSIPVKDLY AQLGANLKLD LLFRAMVYVI VILVMGFIMS RQILNPIKLL
SERMIAVTRT GNFKDKLPQK TNDEIGMLIG SFNEMMDELS SRTVQSKEAD ERYRRFIEVA
ASAVITFLKD GKIVIANQKA ESLFGRSRQE LLGESIFSFL EDGAALKDRL STQTEFRDEA
SRQIVNGSGG KRTEVEMVLS VSRTDREPMF TAILRERRG