Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2580 |
Symbol | |
ID | 5163616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 2984560 |
End bp | 2986122 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640550077 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001231331 |
Protein GI | 148264625 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAG GCAATATGAC GGTACACGAT CTTATGGATA TTCTGAAGCG GAGGAAGTGG AGCTTGCTCC TGCCGGCGGC AGCGCTCTTT CTTGTTGCAG TGAGCGTGGC ATTTATCCTG CCGCCGATCT ATCGCTCCAC CACGACCATT CTGATCGAGG AGCAGGAGAT CCCTCCCGAG ATGGTTGCCA CCACGGTGAC GAGCTTTGCC GAACAGCGGC TGCAGGTGCT CAACCAGCGC ATCATGAGTT CCACGAGGCT TTTGGAGATC ATCAATCGTT TCAATCTCTA CGCGGACATC AAGGACAAAA TAACGACGGA AGAGATGATC GAGAAGATGC GCAAGGACAT CAAGTTCGAT ACGATCAGCG CCGATGTCAT AGATCGCCGT ACCGGCCGGG CGACTCAGGC CACCATCGCT TTTTCCCTGT CTTACTCAGC CAGGAACCCT GCGACTGCCC AGCAGATCGC CAACGTCCTG GCTTCCCTTT ATCTGGAAGA GAACCTGAAG GTGCGTGAGC AATCCACGTC GGGGACTTCG AAGTTTCTTG AGGACGAGAT GAAGGACGTG CAGGCGAAGC TGGTCGGGTT TGAAGCGCAG ATTTCCGCTT ATAAGCAGCG AAACCTGAAT TCCCTGCCGG AACTGGTTCA AACCAACCTG TCGGAGCTGG ACCAGGTGGA GCGTAGCATT ATCCAGTTCA ATGACCAGTT GCGCACCCTG AAGGAGAAGG AAGGTTACCT GCGGAGCCAG CTTGCGAACA TCACGCCCGA AGACGAGAAT CAGGACAAGA CCCGCCTCAA TGATCTGAAA GCGAAACTGG TGAACCTGAA GAGCCGCTTC TCCGATGAGT ACCCCGATGT AAAAAAACTT CAGCAGGAGA TTGCGACTCT GGAAAAGCAG CTCCACACAG TCGGCGGAGA TGTAAAGTCT ATCCGTGCCG ATAATCCGAA CTATATTAAT CTGGCTTCCC AACTGGCCGC CGCCCAGTCG GAAATCGACT CGGTGAAACG CCAGCTTGCA CAGTTTCACG ACAAGCGTGA TTCTTACCGC AAACGGATTC AGGCTGCGCC GAAGGTTGAG GAAGGGTTTA AAAACCTGAT GGTCGAGCGA AACAACATGC AGTTGAAATA CGATGATCTT TCGAAAAAAT TTCTGGAAGC CAAGGTCGCC CACGGCCTGG AGAAAGAGCA GATGGGCGAA CGGTTCACCA TCGTCGATGC GGCCAGGCTA CCTGAAAAGC CGGTGAGTCC CAATGTGCCG GTTATCATGC TGATCGGCCT GATTCTCGGG ATCGGCAGCG GGGTAGGCGT TGCGACCATT CGCGAAACCG GCGACAAATC AGTGCACAGC ATGGAGGTCT TGGCCAAGGC AACCATGTAT CCCGTGCTTG CCGCCATTCC TGAAATCGTC ACCTGGCAGG ATCAGCAACG GCAGCTGAGA AGACGCAGAT CGCTTCTTGT TGCGGGCATA ATGATCATTC CCATTTCCCT GCTGGCAATT CATTTTCTGG TCATGGACCT GAGTGTGGCC TGGGCCATTT TCAAGCGCAG AATGGCTCTT TGA
|
Protein sequence | MTTGNMTVHD LMDILKRRKW SLLLPAAALF LVAVSVAFIL PPIYRSTTTI LIEEQEIPPE MVATTVTSFA EQRLQVLNQR IMSSTRLLEI INRFNLYADI KDKITTEEMI EKMRKDIKFD TISADVIDRR TGRATQATIA FSLSYSARNP ATAQQIANVL ASLYLEENLK VREQSTSGTS KFLEDEMKDV QAKLVGFEAQ ISAYKQRNLN SLPELVQTNL SELDQVERSI IQFNDQLRTL KEKEGYLRSQ LANITPEDEN QDKTRLNDLK AKLVNLKSRF SDEYPDVKKL QQEIATLEKQ LHTVGGDVKS IRADNPNYIN LASQLAAAQS EIDSVKRQLA QFHDKRDSYR KRIQAAPKVE EGFKNLMVER NNMQLKYDDL SKKFLEAKVA HGLEKEQMGE RFTIVDAARL PEKPVSPNVP VIMLIGLILG IGSGVGVATI RETGDKSVHS MEVLAKATMY PVLAAIPEIV TWQDQQRQLR RRRSLLVAGI MIIPISLLAI HFLVMDLSVA WAIFKRRMAL
|
| |