Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1741 |
Symbol | |
ID | 5164911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 2023099 |
End bp | 2024274 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640549235 |
Product | hypothetical protein |
Protein accession | YP_001230507 |
Protein GI | 148263801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0012648 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA TGGCTACACC GACGCAACCG ATCACCCAGA TCCCATCGAC GGCGGTTCCC ACCGACCTGC AAAATATCGG CGCCATGCTG AACAACATGG GGACTGTCCT GAACAAGGGG GCAAATATGA CCATTGCCGA TTTGACCCCT TTCTTTGTTG CCGACCCCGG CTTCGGCATC AATGGCGGAA TGACCGCATT CCAAGTAATG ACCCTTCTCC AGCTTAGCGT ACCCACATTA ATTGCCCAGG GCACCATTAC GCAAATGTCG AATGTGACCT TTGTCGGCAA TTCCCCGGTC GGAGGCTACA AGATCTCATT CTTTGTGCGC TTCAGTGACG GCTCAATGAC CATGTCCAAT ATGACCTTTT CCGACGAGAT GGTTGTGGCG AAAAACGCCT CCAATGCCTG GCAGTTCAAG GGGAACGGCC ATCGGTCATT ATTGTTCAAC AATATGCTGA CACAGCAGTG GCAGATTACC GCGACCACCT CGCAAACGGA AGCCGGCCTG ATTTTTGACA TGATGGATGT GGAGAATGCC TTCAAATCCG CCGTGGCAAC CGGACCGGGT TTACCCTCAC CAGGAGGCGT CATGTTCATC AAAGAACCCG GCAATCCGAT ACTATTCGTG ATGTCTGCGT CAAACAACTC GTTACCCACC ATGGAATCGA CATTTTTCAC GATGCCTGAT GCAACGATAT CCTCCCTACC GGACAATGCT CCGGTTATAT TCTCTTTTTA CAGTTCGTTG CCCCCACGCA ACAACCCGAT GGAACAACGA ACCATGATAT ACCCGAAGCG CTGCTTAACC AGGGCAGAGG CTGCGGGCAC AAGCGGAGTT TTCCCCATCG TGACCCCCGC GGGGAACTTG AATACCCACT CGTTCTCAAC GATGATGAAT AACATGATGG GCGGCATGAT GGGCGGGATG ATGTCCATGA ACTTTACCTA CACCACGCCC ACGGCCTTGC CTTTTGCAAT GATGACGGCG GATTTCAACA TATCGAGCAG TACTTTCAGC AACGAGGCAA TCCAGACCCT CCCCCTGAAC AGGACTTCCA TGACCATGCG GATGCAGGGA CCAACCACTG CTCCAACGAC AGGCACTGGC ACCTTAAACA TAACTGCAAC GGATATTTTC GGAAGGAATG TGGGAACGGC TTGGATGTTC CAGTAG
|
Protein sequence | MSQMATPTQP ITQIPSTAVP TDLQNIGAML NNMGTVLNKG ANMTIADLTP FFVADPGFGI NGGMTAFQVM TLLQLSVPTL IAQGTITQMS NVTFVGNSPV GGYKISFFVR FSDGSMTMSN MTFSDEMVVA KNASNAWQFK GNGHRSLLFN NMLTQQWQIT ATTSQTEAGL IFDMMDVENA FKSAVATGPG LPSPGGVMFI KEPGNPILFV MSASNNSLPT MESTFFTMPD ATISSLPDNA PVIFSFYSSL PPRNNPMEQR TMIYPKRCLT RAEAAGTSGV FPIVTPAGNL NTHSFSTMMN NMMGGMMGGM MSMNFTYTTP TALPFAMMTA DFNISSSTFS NEAIQTLPLN RTSMTMRMQG PTTAPTTGTG TLNITATDIF GRNVGTAWMF Q
|
| |