Gene Gura_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4221 
Symbol 
ID5164870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4880527 
End bp4881606 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID640551699 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001232937 
Protein GI148266231 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA GCGGATTGAC ACATACGGGA TCTCCAAAAG AGGACGCTCT TTCTCTTTCC 
AATAGGTCTG CTATTGTCGG CATGCTTATG GACAGCGTAG GCATAGGCGT GATCTTTGTC
GAGACCAGCG GACGACTTAC ACTCATCAAC CGCAAGGCCG AGGCAATACT GCAAGCCTCC
GGCAGTTCCG TACTGGGCAA AAGGGTTGAT ATGCTGCCGC TGCGCACCGC TATTTACAAG
GTTCTGAGTG AAGATTGCAG CGAGACTCCT GTTGAGATGA GCATTGACGG TGCGGTAATC
ACCGTCAAAT CATCCGAATT ATATGCTCCT GACGGCGAAA TACTGGGAGA GATGTTCGAG
TTGCGCGATG TTACCGAAGA TAAGAAAGAA AAGAGGCAGC GCGAAGAGAT AGTTGCCATG
ATGACCCACG ATCTCAAGTC TCCATTGACG GTTTTGATGG GATACGTCCA GACGCTGAAG
GGGGAAATGC CGCAAAAGAT CGACATTTCG CTTCAGCCTT GTCTGAAGGA GATGGACAGG
AGCGCTCTAA AGCTTCTTGC CATGATAGAG GACGTCTTGG ACGCTTACCG GCTGGAGGTG
GGTCTCCTGC AGATTAATTG TGCCGTCTGT GATATCGGCG CACTGCTTGA TGGGTGCTGC
TGTGACGGAT TACGTGAAGC CCAGGCGCGC GGTTCGAATC TTACCTGTAA CATCAGTGAG
GGGATTCCTC CTCTCAAGGT CGATGGCAAG CAGCTTTCAC GGGTCTTTGC CAACCTTATC
GGCAATGCGT TGAAGTTTAC CCCTCGCCGC GGCTCAGTCA CGGTGACTGC TGAAGTGCGG
GAGGATAAGG TTTTTGTTTC CGTTAAAGAT ACCGGGATCG GGATCCCGCA GAAAGATGTG
CCGCGGATCT TTAACAAGTA TTTTCGATCC TCTGCCGCTA CCGGCTTCAA AGGGACCGGC
CTTGGCCTGA CCATCAGTAA AGCTATTGTG GAAGCTCACA GCGGTACGAT CGAAGTTGAA
AGTGTGGAGG GCGAAGGCAG CTGCTTTTCG GTCATCATTC CTCTGGGAGC CTGTCATTGA
 
Protein sequence
MEKSGLTHTG SPKEDALSLS NRSAIVGMLM DSVGIGVIFV ETSGRLTLIN RKAEAILQAS 
GSSVLGKRVD MLPLRTAIYK VLSEDCSETP VEMSIDGAVI TVKSSELYAP DGEILGEMFE
LRDVTEDKKE KRQREEIVAM MTHDLKSPLT VLMGYVQTLK GEMPQKIDIS LQPCLKEMDR
SALKLLAMIE DVLDAYRLEV GLLQINCAVC DIGALLDGCC CDGLREAQAR GSNLTCNISE
GIPPLKVDGK QLSRVFANLI GNALKFTPRR GSVTVTAEVR EDKVFVSVKD TGIGIPQKDV
PRIFNKYFRS SAATGFKGTG LGLTISKAIV EAHSGTIEVE SVEGEGSCFS VIIPLGACH