Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_1672 |
Symbol | |
ID | 5164174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 1941222 |
End bp | 1942262 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640549168 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001230440 |
Protein GI | 148263734 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCGA TGTTCACCGA CAAGACCCTC CTCATCACCG GCGGGACCGG TACCTTCGGC AATGCCGTTC TGCGCCGCTT TTTGGAGACG GAAATCGGCG AGATCCGTGT TTTCTCCCGC GACGAGAAGA AGCAGGACGA TATGCGCAAG CACTTCACCC ATCCGAAGCT ACGCTTTTAC ATCGGTGATG TGCGCACTGA ATCAAGTCTG CGCGGCGCCA TACAGGGGGT GGATTTCGTC TTTCACGCTG CTGCACTCAA GCAGGTGCCG TCGTGCGAAT TCTTTCCCCT CGAAGCGTTG CGTACCAACG CCCTGGGTAC GGAGAACGTC CTCAATGCCG CCATCGGTGC GGGCGTACGG CGGGTGATCG TACTCAGTAC CGACAAGGCT GTCTATCCGA TCAACGCCAT GGGGATCTCC AAAGCGATGA TGGAGAAGCT CGCTGTCGCC AAGTCCCGCG TCGCCGCACC TCAAACGACG ATCTGCTGCA CCCGCTACGG TAACGTCATG GCGAGTCGTG GTTCGGTAAT TCCCCTTTTT CTGAAGCAGA TTCAGGATGG GAAAGGGATC ACTATCACAG ACCCGAAGAT GACTCGCTTC ATGATGACGA TTGAGGATGC GGTTGACTTG GTCCTCTATG CATTTGAACA CGGCTCTGCC GGTGACACCT TCGTGCAGAA GGCCCCTTCC TGCACCATCG GTGCCCTCGC CGAGGCCCTG AAAAAGCTTA CCGGCAGCAG AGTGCCGATT GAGATCATCG GCACGCGCCA TGGCGAAAAG TTGTATGAAA CCCTCCTTAC CCGCGAAGAG ATGGCAGTCG CCGAAGATCA GGGAGGATAT TACCGGGTTC CGGCGGACAA CCGCGATCTC AACTATGGCA AGTACTTCAC TGAGGGTAAA GAGCGGATCA CCGAGCAGAC CGACTACAAT TCGCACAACG TTCCGCTCTT AGATGCCGAC GGCATGGCCG AGCTGCTCAA GAAACTCCCT TGTGTCCAAC AGGCCCTGCG CGGGGAAAAA GTTGCCAGAG GAAAAGAATG A
|
Protein sequence | MTSMFTDKTL LITGGTGTFG NAVLRRFLET EIGEIRVFSR DEKKQDDMRK HFTHPKLRFY IGDVRTESSL RGAIQGVDFV FHAAALKQVP SCEFFPLEAL RTNALGTENV LNAAIGAGVR RVIVLSTDKA VYPINAMGIS KAMMEKLAVA KSRVAAPQTT ICCTRYGNVM ASRGSVIPLF LKQIQDGKGI TITDPKMTRF MMTIEDAVDL VLYAFEHGSA GDTFVQKAPS CTIGALAEAL KKLTGSRVPI EIIGTRHGEK LYETLLTREE MAVAEDQGGY YRVPADNRDL NYGKYFTEGK ERITEQTDYN SHNVPLLDAD GMAELLKKLP CVQQALRGEK VARGKE
|
| |