Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0989 |
Symbol | |
ID | 2685663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1065932 |
End bp | 1067824 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637125659 |
Product | NHL repeat-containing protein |
Protein accession | NP_952043 |
Protein GI | 39996092 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02242] phage tail protein domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCACG GCCTCGACCG CCGTCTGGGG CTCGCGGGAC GCATCCGCCC CGACGCCCTG GTGGCTGATC GCTGCGGAAC GCTGATCATG CTGGCCGGCG AGCGCTTCTA TCGTCTGGAC CCCATGTCCG GCCGACTGGA GCGGATACCC TGTCTCGGCG GCAGGGGGGA CAGGGCGGGG GAACTGAGTG GACCCCGGGC CATGGCTCTG GGGAGCCGCA ATCTCTACGT GGCCGATACG GACAACAACC GGGTCTGCGT CTTTGCCACG GTTAACTGGC AGGTGCGCCG GTTTATCGGA GCCGAGAACC CGGCAGGGGA ACCGGCGGCC GGAACCGGTC CGGGGGAGTT CGACCGCCCC CTGGATCTGG CGGTGGACCC CTGCGACAAC CTCTATGTGC TCGATGCCGG CAACCGGCGC ATCCAGCGTT TCGATTACCA TGGCGAACCC GTGCCCCATG TGCCCCCCTT CGGCGCCGAC CGGCTGAAGC AGCCGGTGGC GCTGGCGCTG GGCCCGGCTC CCTCCCCGTC CGGCGGGGGG GCCCTGGTCC ACTGCCTCGA TACGGGGCTC ACCGCCATCG TTACCTTTGA CGACCAGGGC CGGTTCCTGG GCACCGTCGG CCTGGACGAC CTCGGCTTCG AGCCTGCCGG TCTGGCGGTG GACGCCGACG GCAAATGGTA CGTCTCTGAT CGGGAGCGGT TCATCTATGC CATCAGGTCG GCAGGTGACT GGAGCCCTCT GGAGGAGTAC GAGGGGAAGG CCCTGCGGCT GTTCGCCGGT CCCGGCGGGG AGTTTTACGC CCTGGAAGAT GGGGAGGTGG CCCGCCTCAC CCGTCGCCGT CGCTATCCGC CGGCAGGCTC GTGGACCGGC AGCGGCCCGG TAACGGGCAT CTACACTTCC CGCTCATTCG ATACCGGCGA CGGCCGCCTC TTCTGGCACC GGGTGACGCT GGACGCGACG GTCCCCCCAA AGACCCAGGT GCGACTTTCC TACTTCATCT ACGAAACCGG CCGCGATCCG GAGCTCCTGC CGGCCGACGG CGAGTGGCGA AGTTTCCCGT CCAATCCGGC CGACGCCCTT TTTGAGCGGA AGGAGGGACG GTATCTGAGG GTGCGCCTGG AACTGATCTC CGAAGACCGG CATGCCACCC CCACAGTGGT GAGCCTCCGC CTCCAGTTTC CCAAACAATC GTATCTCCGC TACCTTCCGG CCGTCTTCCA GGACGACGAG CGGGGGCGTG ATTTCCTGGA GCGGTTTCTT TCCCTGTTCG AGAGCGTCCT TTACGACCTG GAGCGCGAAA TATTCACGAC CCGGCGCTAT GCCGACCCCT GTGCCGTACC GGCCGGCTTC CTTCCCTGGC TCGCCTCGTG GCTGGCGCTC CCCGATGCGG ACCAGTGGCT CGAAGATGGC GGGGCACGGC TGCGCACACT GATCGCCCGG GCCAATGAGC TGTACCGTTG TCGGGGCACC CGGGGTGGAC TGGCGGAACT CATCACTCTT TACACCGGCA AGGAACCGTG GATTGTGGAG GCGTTCCAGT TGGACCGCAT CCGGGGCCGG AGCGAGTGGC GCGAAACCAT GAGGCTCTTC GGGGAAGATC CCTACCACTT CACGGTTCTC CTGCCGCCGG GCGGCGCAGG GAGGACCGAA ACCGTAAAGC GGATCGTGGC GCGCGAGCGC CCGGCCCACA CCTGTGCCAC GGTAGTTGCC CTGGAGAACC TGTTCCGGCT TGGGGGGCAC ACCTACCTGG AAGTCAATAC CAACCTGAAC CAGCCCCTCT TTGCCCTGGA GACCTCCTCG TCGCTCGCCC GGCAGACCTA TCTGGCCGAC GGCGAGAAGG CGGGGCAGGC GCAGGTCCGG GCGCGACAGG GCATGGATAC ATTGTTTGAG TGA
|
Protein sequence | MVHGLDRRLG LAGRIRPDAL VADRCGTLIM LAGERFYRLD PMSGRLERIP CLGGRGDRAG ELSGPRAMAL GSRNLYVADT DNNRVCVFAT VNWQVRRFIG AENPAGEPAA GTGPGEFDRP LDLAVDPCDN LYVLDAGNRR IQRFDYHGEP VPHVPPFGAD RLKQPVALAL GPAPSPSGGG ALVHCLDTGL TAIVTFDDQG RFLGTVGLDD LGFEPAGLAV DADGKWYVSD RERFIYAIRS AGDWSPLEEY EGKALRLFAG PGGEFYALED GEVARLTRRR RYPPAGSWTG SGPVTGIYTS RSFDTGDGRL FWHRVTLDAT VPPKTQVRLS YFIYETGRDP ELLPADGEWR SFPSNPADAL FERKEGRYLR VRLELISEDR HATPTVVSLR LQFPKQSYLR YLPAVFQDDE RGRDFLERFL SLFESVLYDL EREIFTTRRY ADPCAVPAGF LPWLASWLAL PDADQWLEDG GARLRTLIAR ANELYRCRGT RGGLAELITL YTGKEPWIVE AFQLDRIRGR SEWRETMRLF GEDPYHFTVL LPPGGAGRTE TVKRIVARER PAHTCATVVA LENLFRLGGH TYLEVNTNLN QPLFALETSS SLARQTYLAD GEKAGQAQVR ARQGMDTLFE
|
| |