Gene Gura_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1550 
Symbol 
ID5162787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1805073 
End bp1806743 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content61% 
IMG OID640549049 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001230321 
Protein GI148263615 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCGGC GACTTAGCAT TCGCAACAAA CTGGCTTTCG CGCTGTGGGG CGCGGCGCTG 
CTGGCCTTCG CCGTGGCGAG CGCGGCGTTG GCGTTCTTTG CGAGCCTAAC GCTGGAACGT
CGAGCCCGGC AGATCATGGA GCCCTATGCG CAACTGGTTT CCGTCGGGGC GGAAGCGGCG
GTGGCTTTCG AGGACCCCGG ACGGGCACGG GAAATTCTCA ATACGTTACG GGCCAATCCG
CAAATTTCGG AGGCGGAAAT AGTTTTGGGG GACGGACGGT TGCTGGCCCG CTACAGCAGC
AGGTCCAACG CAACATTCAG GCACCATCCG CTCAAACCCG ATGGCGTTTA CCTGAATCAT
AACACCGCGG AGTTGGTGCA GAGCCTGCAA GACGGCGCGC ATTTGTACCT CGCCATGAGC
CTGGACGAGC TCAACCGGCA AACTCGAAAC GTCTTGCTGG TGTTTGCCGC AGGGGTAGTC
GTCTTGCTGG TGTCCATCAC CCTGGGCCTG CTGGCAGCTC TGCAACGAAC CATCGTCCGC
CCCATCTCCA CGCTGGCCGA GACCGTCGAG CAGGTTCGCA TCCGAGGCGA CTACCACCAG
CGCGTACCGG CCTCCGGCGC CGACGAGGTG GCCCGGCTTG GCCAGAGCTT CAACGCCATG
ATGGGGGCGA TCCAGGAGCA GGACAATGAT TTGCGCCGAC TCACCCTCTT CCAGCGGACG
CTTTTGGACA GCGCAGCCCA CGGCATAATC TCTACGGCTC CCGATGGGGT TGTCAGCAGC
TTCAATCCTG CCGCAGAGCG GTTGCTGGGT TACACGGCCG ACGAAGTGGT CGACAAACAG
ACACCGGCGT GCTGGCACGA TCCGGAAGAG ATGGCGCGGC GCGCCCTTCA GTTGTCCGAA
GAACTGGGCG AAACGATCTC GCCGGGATTC GATGTGTTTG CGGCCCGCCC CCGGCGTAAC
CTGCCCGAGG AAAACGAGTG GACCTTCATT CGCAAGGACG GGGCGTGCGT GCCGGTGCAT
CTGTCCGTGA CCGCGCTGCG AGGTGAGAGC GACCGGATCA GGGGATTCGT CGGGCTGACC
TATGATCTCA CCGAGCGCAA ACGGGCGGAG GAGGAACGAC GGGAAAGCGA GGCAAAGTAC
CGCCGCATCG TAGACACGGC CATCGAGGGG ATCTGGGTGC TCGGGCCGGA CACCATGACC
ACCTTCGCCA ATGCCAGGAT GGCCGAAATG CTCGGCTATT CCGGCGAGGA GATGATCGGC
CGGCCGTTGA CCGACTTCAT GTTCGAGGAG GATGCGCCCG ATCATCTGAG AAAAATGGAG
AATCATCGCC AGGGCCTCTC GGAGAATTAC GAACGCCGAT TCCGCCGTAA AAACGGAGAG
ACGGTATGGA CTCTTACTTC TGCTACCCCT ATCTTTGATG ATGAGCATCA TTTCCAGGGC
TCCTTTGCGA TGTTCACCGA CATCTCCGAA AAGAAGCTGG CAGAGGAAGA GCTTCGCAGG
CTCAAGGACG AACTCGAACA GCGGGTGCAG GAGCGGACCG CCGAGCTTGC GGCCAAGAAC
GCTGAACTGG AGCGGATGAA CCGGATCTTC GTCGGCCGGG AGCTGCGGAT GGTGGAGCTG
AAGGAAAGGA TCGGGGAGCT GGAGAAAAAG CTCGGAGAGA GAACGCCATA G
 
Protein sequence
MFRRLSIRNK LAFALWGAAL LAFAVASAAL AFFASLTLER RARQIMEPYA QLVSVGAEAA 
VAFEDPGRAR EILNTLRANP QISEAEIVLG DGRLLARYSS RSNATFRHHP LKPDGVYLNH
NTAELVQSLQ DGAHLYLAMS LDELNRQTRN VLLVFAAGVV VLLVSITLGL LAALQRTIVR
PISTLAETVE QVRIRGDYHQ RVPASGADEV ARLGQSFNAM MGAIQEQDND LRRLTLFQRT
LLDSAAHGII STAPDGVVSS FNPAAERLLG YTADEVVDKQ TPACWHDPEE MARRALQLSE
ELGETISPGF DVFAARPRRN LPEENEWTFI RKDGACVPVH LSVTALRGES DRIRGFVGLT
YDLTERKRAE EERRESEAKY RRIVDTAIEG IWVLGPDTMT TFANARMAEM LGYSGEEMIG
RPLTDFMFEE DAPDHLRKME NHRQGLSENY ERRFRRKNGE TVWTLTSATP IFDDEHHFQG
SFAMFTDISE KKLAEEELRR LKDELEQRVQ ERTAELAAKN AELERMNRIF VGRELRMVEL
KERIGELEKK LGERTP