Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3226 |
Symbol | |
ID | 2687686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3538273 |
End bp | 3540003 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637127919 |
Product | cytochrome c family protein |
Protein accession | NP_954267 |
Protein GI | 39998316 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCAC GACCCATGAT AACGGCACTG GTCTTCTTCC TGGGAACGGC GGCCCCGGCG GCGGCCCTCT CTCCCTCCCT GAGCGTGACC GATTCTCACA ACCGCCACAA CCTGTCCGTA AGCGGCCCGG GGCTCACCGG TGCGACCCGC GCCTCCACCG AAGAGCGGGT CTGCATTTTC TGCCATACGC CCCACCACGC CACGAGCATA ACCCCCCTCT GGAGCCGCGA GTTGTCCAGC GCGCTCTACA CCCCCTACGA CTCGTCGACC CTCAAGGCAA TCCAGAAACC GGGCCAGCCT ACCGGCGCAT CCCGTCTTTG CCTGGGGTGT CATGACGGCA CCATCGCCCC CGGCATGCTC TCGGGGGGCA AGATGATCGC GGGGCTGCCG ACCCTGCCCA CGAATGTGCG CTCGAACCTG TCCACTGATC TCTCCGACGA CCATCCCATT TCCTTTCCCT ATGCCGGATC CATCTCGGTC GCCGCCGTGC TCCGGCTGCC GGCCGAACTG CCGCCCGAAA TCTCGCTCGA AGACGGCAGA ATCGAATGCA CCAGCTGCCA CGATCCCCAC AAGGACCGCT ACCCTCCGGC TGATCATCCG CAGAAATCGG GGAAATTCCT GGTGCTCGAC AACAACAATT ACTCGGCCCT CTGCACCGCC TGCCACAGCC GCGCCGGGCT CACGGAGGGC GCCCACTACC TGCCGGGCGA TCCCTGCGAA AGCTGTCACC AGGTCCACCG GGCCGATCAG CCGGCACGGC TGCTGAAGGG AGCCAATGTG CAGGAGACCT GCGTCCTCAC CTGCCACAAC GGAACCGGCA CGGTGGAAAC ATCCGGGACC GACATCCGCT CCGCATCGGT CTTCGGCAAA GCTCATCAGA CCGGCCGGCT CGTTCTTTCC GGCAGGCACG ACGCCGACGA GAACCCCCTC GACTTCGAGC CGTCACAGCC GCACGTGGAG TGCGTCGATT GCCACAATCC CTGCATCACC CGTCATGAAT CAACACCTCT TGCTTCGCCG CCGACGCTGA ACGGCCGGCT CGTAGGGGTT ACCATCGCCA AGTCGCCCGA CGGCATCAAG ACTTACGCCG ACAGCGAATA TGCCATCTGC TACAAATGCC ACGGCGACCG GAGTTTCGTG CCCCCGGCCG TGCCCCGCAG GGTCCAGACC GGTGACCAGA GCCTCCGCTT TGCCCAGGAA AACCCCTCGT ATCACCCGGT ATCGGCGCCG GGCAAAGGGA TGAGCGTGCC AAGCCTGCGC TTTGAAATCG TCGGGTTCAG GCCCGCTCGG ACCCTCTCCA TCTCAAGCCT CATCTACTGC ACCGATTGCC ATTCGAGCAA CAAGGGCAGC AAGGTGGGGG GGAGCGGCGC ATCCGGACCC CACGGCTCCG ATTACCAGCC AATCCTGATG GACCGCTACG AGCACGACAC CTATCCCCTC GCCTATGCCG AGAGCAACTA CTCCCTCTGC TTCCGTTGCC ATGACCAGAC GATTCTGCTG GACCCGGGCC GATCGGCGTT TCCGCTCCAC CAGTCCCACC TGGTCAACCA TCAGGTACCC TGCTCAGTCT GCCACGACCC CCATGGCGTT CCCCTGGTTC TGGGGGGCAC CACGGCTGCA AACAGCCACC TGATCAACTT CGATACCCGC TTCGTCACCA CCGGCAGCTA TGACTCTCCG GGTCGCAGCT GCACCGTGAG TTGCCACTCG GCCAACCCGA GAACCTACTG A
|
Protein sequence | MAARPMITAL VFFLGTAAPA AALSPSLSVT DSHNRHNLSV SGPGLTGATR ASTEERVCIF CHTPHHATSI TPLWSRELSS ALYTPYDSST LKAIQKPGQP TGASRLCLGC HDGTIAPGML SGGKMIAGLP TLPTNVRSNL STDLSDDHPI SFPYAGSISV AAVLRLPAEL PPEISLEDGR IECTSCHDPH KDRYPPADHP QKSGKFLVLD NNNYSALCTA CHSRAGLTEG AHYLPGDPCE SCHQVHRADQ PARLLKGANV QETCVLTCHN GTGTVETSGT DIRSASVFGK AHQTGRLVLS GRHDADENPL DFEPSQPHVE CVDCHNPCIT RHESTPLASP PTLNGRLVGV TIAKSPDGIK TYADSEYAIC YKCHGDRSFV PPAVPRRVQT GDQSLRFAQE NPSYHPVSAP GKGMSVPSLR FEIVGFRPAR TLSISSLIYC TDCHSSNKGS KVGGSGASGP HGSDYQPILM DRYEHDTYPL AYAESNYSLC FRCHDQTILL DPGRSAFPLH QSHLVNHQVP CSVCHDPHGV PLVLGGTTAA NSHLINFDTR FVTTGSYDSP GRSCTVSCHS ANPRTY
|
| |