Gene Gura_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1741 
Symbol 
ID5164911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2023099 
End bp2024274 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID640549235 
Producthypothetical protein 
Protein accessionYP_001230507 
Protein GI148263801 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0012648 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA TGGCTACACC GACGCAACCG ATCACCCAGA TCCCATCGAC GGCGGTTCCC 
ACCGACCTGC AAAATATCGG CGCCATGCTG AACAACATGG GGACTGTCCT GAACAAGGGG
GCAAATATGA CCATTGCCGA TTTGACCCCT TTCTTTGTTG CCGACCCCGG CTTCGGCATC
AATGGCGGAA TGACCGCATT CCAAGTAATG ACCCTTCTCC AGCTTAGCGT ACCCACATTA
ATTGCCCAGG GCACCATTAC GCAAATGTCG AATGTGACCT TTGTCGGCAA TTCCCCGGTC
GGAGGCTACA AGATCTCATT CTTTGTGCGC TTCAGTGACG GCTCAATGAC CATGTCCAAT
ATGACCTTTT CCGACGAGAT GGTTGTGGCG AAAAACGCCT CCAATGCCTG GCAGTTCAAG
GGGAACGGCC ATCGGTCATT ATTGTTCAAC AATATGCTGA CACAGCAGTG GCAGATTACC
GCGACCACCT CGCAAACGGA AGCCGGCCTG ATTTTTGACA TGATGGATGT GGAGAATGCC
TTCAAATCCG CCGTGGCAAC CGGACCGGGT TTACCCTCAC CAGGAGGCGT CATGTTCATC
AAAGAACCCG GCAATCCGAT ACTATTCGTG ATGTCTGCGT CAAACAACTC GTTACCCACC
ATGGAATCGA CATTTTTCAC GATGCCTGAT GCAACGATAT CCTCCCTACC GGACAATGCT
CCGGTTATAT TCTCTTTTTA CAGTTCGTTG CCCCCACGCA ACAACCCGAT GGAACAACGA
ACCATGATAT ACCCGAAGCG CTGCTTAACC AGGGCAGAGG CTGCGGGCAC AAGCGGAGTT
TTCCCCATCG TGACCCCCGC GGGGAACTTG AATACCCACT CGTTCTCAAC GATGATGAAT
AACATGATGG GCGGCATGAT GGGCGGGATG ATGTCCATGA ACTTTACCTA CACCACGCCC
ACGGCCTTGC CTTTTGCAAT GATGACGGCG GATTTCAACA TATCGAGCAG TACTTTCAGC
AACGAGGCAA TCCAGACCCT CCCCCTGAAC AGGACTTCCA TGACCATGCG GATGCAGGGA
CCAACCACTG CTCCAACGAC AGGCACTGGC ACCTTAAACA TAACTGCAAC GGATATTTTC
GGAAGGAATG TGGGAACGGC TTGGATGTTC CAGTAG
 
Protein sequence
MSQMATPTQP ITQIPSTAVP TDLQNIGAML NNMGTVLNKG ANMTIADLTP FFVADPGFGI 
NGGMTAFQVM TLLQLSVPTL IAQGTITQMS NVTFVGNSPV GGYKISFFVR FSDGSMTMSN
MTFSDEMVVA KNASNAWQFK GNGHRSLLFN NMLTQQWQIT ATTSQTEAGL IFDMMDVENA
FKSAVATGPG LPSPGGVMFI KEPGNPILFV MSASNNSLPT MESTFFTMPD ATISSLPDNA
PVIFSFYSSL PPRNNPMEQR TMIYPKRCLT RAEAAGTSGV FPIVTPAGNL NTHSFSTMMN
NMMGGMMGGM MSMNFTYTTP TALPFAMMTA DFNISSSTFS NEAIQTLPLN RTSMTMRMQG
PTTAPTTGTG TLNITATDIF GRNVGTAWMF Q