Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3225 |
Symbol | |
ID | 2687683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3537164 |
End bp | 3538261 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637127918 |
Product | NHL repeat-containing protein |
Protein accession | NP_954266 |
Protein GI | 39998315 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCAAT CAGTCCCGGC ACGCGGCACA TCCCCTGCAC GCCGGCTTGC GACCGCGGCC AGGCACTGCC TGGTCCTGGC CGTGGCGAGC ATCCTGGCCG CCTGCACGAC CATTACGGCC GTAACACCGA ACGAACCGGA GCAACGGCTG GTCTGGCCGG GACCGCCGCT CCAGCCGCGC ATCGAGTGGG TGCGCGAAGT CTATAACCAG AAGGGGCTCG GGGTATCTCC CGGGTTCTGG GGCAGGATCG CGCGGTTCGT ACTGGGCGAG AAGGAGGAGC GATTCATTCG TCCCCACGGA ATTCTGGCCG ATGAACAGAT CTTCGCCTTG GTCGATTCGG GCGCCGGGCG AGTGCACCTG ATTGATCTGA AGCGAGGGAC CTATCGGCTG CTGCCGGAGG AAGGCAAGAC CCCGATGGTC TCACCCATCG GGATAGCCCG GGACAGCCGG GGAGCGATCT ATGTAACCGA CTCCGGCACC GGCCTGATCC ACCGCTTTTC CGACGACGGC GACTCTTTCG TCGCGCTGGA CCTCCGCCCG CTTCACCGCC CCACCGGCAT CGCCTTCAAC CCGGTAACGG GCCTGCTGTA CGTGGCGGAA ACCGGCGCCC ACCGGATCGT AGCCTTCGAT TCGGCGGGCA AGGAAACCCT GCGCATTGGC GGGAGCGGCA TGGAGCCCGG CGCCTTCAAC TTCCCCACCG ACCTGGCCGT GATGGCCGAT GGACGCCTTC TGGTAACCGA CTCCCTCAAC AGCCGGATTC AAATCTTCAC GGCAGACGGG AAGCCGGCGG GAAGCTTCGG CGAGGCCGGG GATACTCCCG GTCGCTTCAC CCGCCCCAAG GGGGTCGCAG TGGACAGCGA AGGGCATATC TACGTCTGCG ACAGCCAGCA GGACATGGTT CAGATCTTCG ACGAGACGGG CCGGCTGCTC CTGGCCTTCG GCGACAAGGG AAGCCTCCCC GGCCAGTTCT GGATGCCTTC CGGCATCCAT ATCGCCAATG ACATGATCTA TGTCTCCGAT ACGTATAACC AGCGGGTACA GGTCTTCCGC TACCTGAAGG AAGAGCCCTG GGGGCAGGAC CCCCACACCC CCGACTAA
|
Protein sequence | MEQSVPARGT SPARRLATAA RHCLVLAVAS ILAACTTITA VTPNEPEQRL VWPGPPLQPR IEWVREVYNQ KGLGVSPGFW GRIARFVLGE KEERFIRPHG ILADEQIFAL VDSGAGRVHL IDLKRGTYRL LPEEGKTPMV SPIGIARDSR GAIYVTDSGT GLIHRFSDDG DSFVALDLRP LHRPTGIAFN PVTGLLYVAE TGAHRIVAFD SAGKETLRIG GSGMEPGAFN FPTDLAVMAD GRLLVTDSLN SRIQIFTADG KPAGSFGEAG DTPGRFTRPK GVAVDSEGHI YVCDSQQDMV QIFDETGRLL LAFGDKGSLP GQFWMPSGIH IANDMIYVSD TYNQRVQVFR YLKEEPWGQD PHTPD
|
| |