Gene Gura_4254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4254 
Symbol 
ID5165921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4916581 
End bp4917804 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content50% 
IMG OID640551732 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001232970 
Protein GI148266264 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.157553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTGGT TTGCCACCAT GAAATTGAGC GCCAAGTTCA ACCTGATCAT GTCCAGCCTG 
TTAATCGTCT TGTTTCTTGC CGCCGCGTTC TTTACCTATA AGCGCGAGCA GTTGCTAATT
ATGAAGGTGG CGGTCGATAA CGCGCGTCAC ATTGCCAAGC AGATAATTGA AACCCGGGAC
TACATCTCCA GCGTGGTGCG GGGCGAGCCG GAAGGGAATT ATGCCCTTGT CCCGCAGGTG
GTGGCTACAC AGGTAGCAAA AAGAATGACC ACCGGCAGTA AATATTACGT GCGCCAGGTT
TCATTGCGCT ATCGCAATCC TGAAAACCGC CCCGATGATT ACGAAACGGA ACAGTTAAAG
AAATTTGCCG GTAAAGCCAT CAGAGAGTCG TATTCGGTAG TCGAGGTTAA GGGCGAGCAG
TCTTTTCGTT ATATGCAGTC AATGGTGGCG GAAAAATCAT GCCTTGAGTG CCACGGAACC
TACGATCAGG CCCCCTTGTT CATACGCAAC CGTTTCCCGC GCGGCCATTA TTCTTACAAC
TATAAACTAG GTGAGGTCAT CGGGGCCGTT TCGGTGACCA TTCCAATGGC CGAGCTGTAT
CGTGAAATAG GTACAAACCT GAAGGTTGAC CTGATATACC GCGGAGGTAT ATTTTTTGTC
ATTATTGTGA TAATGGGGGC CTTGATTAGG CGGAACATCA TCAATCCGAT CAAGATGCTG
TCGGAGAGCA TCACCCAGGT AACGAGAACC GGCAGCTTTG CCGATCGACT GCCGAAGAAG
TCGGATGACG AAATCGGCCA GCTCATTAAT TCATTTAACG AAATGATGGC GGAGCTGGAG
CGCAAAATAG AGCAAAGCAG GGAATCCGAA GAACGTTACC GTAAATTCAT TGAGATTGCC
AAGTCTGCGG TTGTCACCTT CATGCATGAC GGGAAAATTG TCATTGCGAA TCAGAAGGCC
GAGGAACTAT TCGGTCTTCC CCGCCAGGAA CTGTTGGGGG AAATCGTCTA TAATTTCTTC
GAGAACAGCG AAATGCTGAG GGAAGAAGTT TCCGATTATC TGCGAACCGG CGAGGAGAGG
AAAGGCGCTG CTCGGACAAC CATGCAAAAG GTGCGTGATG TCAAAGGCGT TTCAAGGGAG
GTAGAAGTGG CCCTTTCAGC GACCCAGACG GAGCATAGGC CGATGATAAC GGCGATCTTG
AGGGAACTCA CCAGCAATAA ATGA
 
Protein sequence
MSWFATMKLS AKFNLIMSSL LIVLFLAAAF FTYKREQLLI MKVAVDNARH IAKQIIETRD 
YISSVVRGEP EGNYALVPQV VATQVAKRMT TGSKYYVRQV SLRYRNPENR PDDYETEQLK
KFAGKAIRES YSVVEVKGEQ SFRYMQSMVA EKSCLECHGT YDQAPLFIRN RFPRGHYSYN
YKLGEVIGAV SVTIPMAELY REIGTNLKVD LIYRGGIFFV IIVIMGALIR RNIINPIKML
SESITQVTRT GSFADRLPKK SDDEIGQLIN SFNEMMAELE RKIEQSRESE ERYRKFIEIA
KSAVVTFMHD GKIVIANQKA EELFGLPRQE LLGEIVYNFF ENSEMLREEV SDYLRTGEER
KGAARTTMQK VRDVKGVSRE VEVALSATQT EHRPMITAIL RELTSNK