Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3222 |
Symbol | |
ID | 2688295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3533249 |
End bp | 3534487 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637127915 |
Product | NHL repeat-containing protein |
Protein accession | NP_954263 |
Protein GI | 39998312 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGC GTTTCGCCGT AGCCCACCGT CCTTTCACAC AAGCTGCCCG GCTGTTCCTG CTCTGTTCGC TGCTCTGGAT CAGCGGATGT GCCGGTAAAA CCGGCACTGC GGGAAAGACC TTTTTTCCGC CGCCTCCCAA CCTGCCCCGG CTCCAGTACC TGATGGGGAT TGCCAACTCG ACCGATGTGG AAGGGAAGGA TTCATCGTTT TCCCTCTTCG GGGGATTGGC CGAGCAACGC GAAAAGATCC GCTACATCGT GAAGCCGTAC GGCATTACCG AGGCCGGCGG CAAGCTCTAC GTGAGCGATG TGGGAACCGC CCAGATCGTC GTCATCGACC TGCCGGGCAA AAAATTCGAG CTGCTCAAGG GGGCTGCCGG ACCTGGCAAG CTGACCACTC CGGCCAACGT GGCCGTGGAC AAGGACGGCT TCATCTACGT GGCCGACGCG GGCCGGAGAG AGGTGGTGGT ATTCACGCCG GAAGGCGATT TCCTCAAGGC CATCGGCGGG GACCGGGACA TGAAGCCCGT GGATGTGGTC GTTAGCGGCG ACCGGGCCTT CGTGCTCGAC ATCAAAAGCA GCGATATCAA AGTGTTCAAC GTCAAGAGCG GCCAGTATCT CGAAAGTTTC GGCACAGCGG GCGGCCCCTT CGAGCGGCTC GCCATGCCCA TCAACCTGGC CATGGACTCC AAGGGGTTTC TCTATGCCAC CAACGGGGTC AGCGGCAGAG TCCTCAAATT CGACCGGGAC GGCAACCTGC TGCTCTCCTT CGGCCAGATG GGTGATGGCT TCGGCCAGTT CGCCCGCCCC AAGGGAATCG CGGTGGATCC CACCGGACTG ATCCACGTGG TTGACGGCGG ACATCAGAAC GTTCAGCTCT TTTCGGATAC GGGACGCCTG CTCCTCTTCT ACGGCGACGC GGGCAAGGAT AGCACCGCAT CACTGAATCT CCCGGCAGGG ATCGCCTATT CCACGGCCAA CCTGGAGTAC TTCCAGAAAA TGGCGGACTC TTCCTTCAAG CTGGACGGCG TGGTGTTCGT CACCAACCAG GGGGGGAAAG CGAACAAGGT GGCCGTGTAC GGGTACGGCA AACGCGAAGG GATCGATTAC GAGCAGGAGT ACGAGAAAAT CCGCAAGGAA CTGGAAGAGC GTGCACGGAA AGCCCGCGAA AAGGAGGCCC AGGAGGGGAA GAAGGCTGGC CAGGCCGAAC CCAAGGCTGC GGAACCCGCC GCGAAGTAG
|
Protein sequence | MKPRFAVAHR PFTQAARLFL LCSLLWISGC AGKTGTAGKT FFPPPPNLPR LQYLMGIANS TDVEGKDSSF SLFGGLAEQR EKIRYIVKPY GITEAGGKLY VSDVGTAQIV VIDLPGKKFE LLKGAAGPGK LTTPANVAVD KDGFIYVADA GRREVVVFTP EGDFLKAIGG DRDMKPVDVV VSGDRAFVLD IKSSDIKVFN VKSGQYLESF GTAGGPFERL AMPINLAMDS KGFLYATNGV SGRVLKFDRD GNLLLSFGQM GDGFGQFARP KGIAVDPTGL IHVVDGGHQN VQLFSDTGRL LLFYGDAGKD STASLNLPAG IAYSTANLEY FQKMADSSFK LDGVVFVTNQ GGKANKVAVY GYGKREGIDY EQEYEKIRKE LEERARKARE KEAQEGKKAG QAEPKAAEPA AK
|
| |