Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0781 |
Symbol | |
ID | 5163571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 934808 |
End bp | 935989 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640548279 |
Product | Sel1 domain-containing protein |
Protein accession | YP_001229562 |
Protein GI | 148262856 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000364549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATT GCACTACGGT TATTCTCTTC GTCTCCATCT TGGTTGCATC GGTTATTTCC GGCAGGGCGG ATTTCGATGC GGGCTTAAGG GCATACAATG ATGGCGACTA CGCCACGGCC ATGCGTGAAT ACAGGATAGA CGGCAGCGCC AGGTCCCTTT TCAACCTCGG TCTCATGTAT GCGGAAGGGA AGGGGGTCAA TAAACGGAAC AGCAGGGAGG CGATGAAATG GTATCGCAAG GCTGCTGAAC AGGGTTTGGC GAAGGCTCAG TTTGCCCTTG GCCTCATGTA CGCTCTCGGC GAGGATGTTG CCGCGGACAA GAAGGAGGCG GCCAGATGGT ATCGTAAAGC CGCCGAACAA GGACATGCGG CCGCCCAGTA CAATCTGGCG CAGATGTATG CCAGGGGTGA TGGCGTCAAG AAGGACGAAA CGGAGGCCGA TAAATGGTAT CGCAAGGCTG CCGAGCAGGG GAATGCCGCA GCCCAGTTGA ACCTGGCTCA GCTATACGAA AAAGGCGCAG GCGTTGTGCA GGATAAAAAA GAGGCGGCCC GCTGGTATCT CAAGGCCGCC GAACAGGGGA ACGTGCGCGC CCAGTTCAGC ATCGCCATGA TGTACGACAA GGGGGACGGG GTCGAGCAAA ACAAAAAGGA GGCTGCGCGA TGGTTCCGCC GGGCCGCTGA GCAGAATCAT GCCAAGGCCC AGTTCAAGAT CGGCTTCCTG TACGATAAGG GTGATGGTGT CCTTCAGGAC AAGAAAGAGG CGGTGAAATG GTATCGTAAG GCTGCCGAGC GGGGGGTGTC GGAAGCCCGG TTCAATCTCG GACTCATGTA TTACGCCGGA TCGGGTGTGC CGCAGGACAA GAAGGCGGCT GCACGGTGGT TCCGCAAGGC CGCCGACCAA GGGGATGTTG ATGCCCAGTT CAACCTGGGG CACATGTACG ACCAGGGGGA CGGGATCAAG CAGGACAGGA AAGAGGCGGT GAAATGGTAT CGCAAGGCCG CCGAACAGGG CTTCGATCAG GCCCAGTTCA ATCTCGGTCT CATGTATTTC CATGGCTACG GCGTGAAACA GAACCGTAAG GAAGCCTTCA AATGGTTTGT AAAAGCGGCT GAACAGGGCT CGGATGAAGC AGTGAAAACC TTGGAAGTCC TCGGCAGGGG GATGCCTCTG GCCCGGCCTT AA
|
Protein sequence | MKHCTTVILF VSILVASVIS GRADFDAGLR AYNDGDYATA MREYRIDGSA RSLFNLGLMY AEGKGVNKRN SREAMKWYRK AAEQGLAKAQ FALGLMYALG EDVAADKKEA ARWYRKAAEQ GHAAAQYNLA QMYARGDGVK KDETEADKWY RKAAEQGNAA AQLNLAQLYE KGAGVVQDKK EAARWYLKAA EQGNVRAQFS IAMMYDKGDG VEQNKKEAAR WFRRAAEQNH AKAQFKIGFL YDKGDGVLQD KKEAVKWYRK AAERGVSEAR FNLGLMYYAG SGVPQDKKAA ARWFRKAADQ GDVDAQFNLG HMYDQGDGIK QDRKEAVKWY RKAAEQGFDQ AQFNLGLMYF HGYGVKQNRK EAFKWFVKAA EQGSDEAVKT LEVLGRGMPL ARP
|
| |