Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0617 |
Symbol | |
ID | 2685411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 652996 |
End bp | 654081 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637125284 |
Product | NHL repeat-containing protein |
Protein accession | NP_951675 |
Protein GI | 39995724 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0567002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTTC TGAAGCTCAT CCGCAGGTTC GTCGGTGCGG CCGCCCTGTG CGCCCTGTGC GCCGGTTGCG CAGGACAACA GGTCCGGGAG GAGCGGCGCT ACTTCTGGCC GCCGCTGCCC GAGCGTCCCA GGATCGAATG GCTCGGTGCC TACAGCAGCC AGAACGACTT CCCGAAGCAG GGATTCGCGT CGTTCATGGC AGCCATTGCC GGAGAAGAAC AGGCCATGAG CCTGACCAAG CCGCTGGATG TCTATGCGGA TGGCCAGGAC CGGATTTATG TGGCAGATCC GGGACTTCGC GGCGTGGTTG TGTTCAATAT GAAAGAGCGG AGCGTGTCGA TGCTCGGCGG ACCCCAGGCG GCTAACCAGT TTAATACCCC GGTTTCGGTC ACCGGTGATT CCCAGGGGAA TATTTATGTT TCCGATGCGG AAAAGGGTGG GATACTGATT TTTGACAGAT TTGAGGTGCC GCGTCGTTTT ATCGACACCA AAGCTGCTGT CAAGAGAAAC ACTGACATCG CCGTGGATGA AAAGGGTCAG AGAATTCTCG TGGTGGATGC GCGCGAGCAC CGGATTGCCA TCCTCGACAT GCAGGGGGGG CTGCTTTCCG CCTTTGGGAA GCGTGGCATC GAAGACGGCG AATTCAACTT CCCCGTGGCG GTGGCCATCA ATCACAAGGG GGAGATTATC GTGGGCGATG CCATGAACGC CCGGGTTCAG ATCTTCGATC AGGACGGGAA GTTTCTCCGC AAGTTCGGGC GCCGTGGCGA CGGGCCGGCT GATTTCCAGA TCATGAAAGG CGTGGCCGTC GACTCGGAGG ATCATATCTA TGTGACCGAG GGAAAAGGTC ACAAGCTCAT TATCTTCGGC ACCAACGGCG AGTATCTTCT CACCGTGGGC GGACTCTACT CCGCCATCAC CACCGGCAAG CAGGCCCCCG GCGGATTCGT CATCCCGCAA GGAGTCTTTA TTGACGATAA GGACGTCATT TACGTGGTTG ACCAACTCAA TCGGCGGTTC CAGGTGTTCC AGTACATCTC CGACGATTTC CTCAAGCGCA ACCCCATCCC GGGATGGCAG GAGTAA
|
Protein sequence | MNVLKLIRRF VGAAALCALC AGCAGQQVRE ERRYFWPPLP ERPRIEWLGA YSSQNDFPKQ GFASFMAAIA GEEQAMSLTK PLDVYADGQD RIYVADPGLR GVVVFNMKER SVSMLGGPQA ANQFNTPVSV TGDSQGNIYV SDAEKGGILI FDRFEVPRRF IDTKAAVKRN TDIAVDEKGQ RILVVDAREH RIAILDMQGG LLSAFGKRGI EDGEFNFPVA VAINHKGEII VGDAMNARVQ IFDQDGKFLR KFGRRGDGPA DFQIMKGVAV DSEDHIYVTE GKGHKLIIFG TNGEYLLTVG GLYSAITTGK QAPGGFVIPQ GVFIDDKDVI YVVDQLNRRF QVFQYISDDF LKRNPIPGWQ E
|
| |