Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_4379 |
Symbol | |
ID | 5166212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 5074417 |
End bp | 5077305 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640551861 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001233095 |
Protein GI | 148266389 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0121684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCG ACAAGATAAT TATCAAGGGT GCATGCGAAC ATAACCTGAA GTGCATCGAC GTGGAGATAC CCCGTGACCA GCTTGTGGTG ATCACCGGCA TCTCCGGTTC CGGCAAGTCG ACCCTCGCCT TTGACACCAT CTATGCCGAA GGCCAGCGGC GCTACGTGGA ATCCCTCTCT GCCTACGCCC GCCAGTTCCT GGAGCAGATG GAAAAGCCGG ATGTGGAATC CATAGAGGGT TTGAGCCCGG CTATTTCCAT CGAGCAGAAG ACCACGAGCA AGAACCCCCG TTCCACGGTC GGCACCGTTA CCGAAATCTA CGACTATCTG CGCCTCCTCT TTGCCCGGGT AGGCAAACCC CACTGCTACA GCTGCGGCAA GGAGATCACC GCCCAGACCG TCTCCCAGAT GGTGGACCAG ATCATGGCCA TGCCAGAGGG GACGCGGATC AACCTCCTCT CCCCCATGAT CCGCGGCCGC AAGGGCGAAT ACCGGAAGGA GCTGAACCAA CTGAGGAAGG ACGGCTTTGT CCGCGTCATC ATCGATAATG TTCCCCATGA CCTCGCAGAG GAGATCACCC TCGACAAAAA CAAGAAACAC GACATCGATA TCGTCGTCGA CCGGCTCATC GTCAAGGAAG GCATTCAGCG CAGGCTGGCA GATTCACTGG CAACGGCATT GAACCACGCC GAAGGGATCG TCAAGGTCGC CATACAGGAA AATCCTCAGG CTGATTCCGC GGCAGAGCCT GCCAACGGCA AGGGCAAAAA AGCTGCGGGC AAGAAGTCGC TCTGGGCGGA TACCATGCTC TTTTCCGAGA GCTTCGCCTG CATCGACTGC GGCATCTCTT ACCCGGAAAT GACGCCGCGC ATGTTCTCCT TCAACAACCC CTATGGCGCC TGCCCTGACT GCACGGGCCT CGGGACGCGC ATGTACTTCG ATGCCGAGCT GGTGATTCCC AACCCGGATC TCTCCATCCG CGAGGGGGCC ATCGCCCCGT GGGAAAAGCG GCTCTCCGGC TGGTATCACC AGACACTGGA AGCCCTGGCC AAGGCGTACG ACTTCGACAT CCGCACCCCG TTCAAAAAGC TTCCGACAAA GGCCCGGGAT ATCATCCTTC ACGGCTCCGG GGGGGATAAG GTCGAGTTCT GGTGGGAGGA GGATGGCGGG CGAAAACATA TCTACCAGAA GGAGTTCGAG GGGGTGCTCA ACAACCTGGA GCGCCGTTAC CGGGAGAGCG AGTCCGACCA GGTGCGGGAA GAGCTGGAAA AATACATGAA CATCATGCCC TGCCCCACCT GCAAGGGTGC CCGGCTGAAG AGGGAAGCGC TTTTCGTCCG GATCGACGGC CACAATATCT GCGACGTCAC CGCCCTCTCC ATCAAGGACT GCCTGGAGTT TTTCGCTAAT CTGCACCTGA CTGAAAAGGA GGAGGAGATC GCCCGCCGCA TCCTCAAGGA GATCCGGGAG AGGCTCCATT TTCTCGTCAA TGTGGGGCTC GACTACCTGT CGCTCGACCG TTCGTCCGGC ACCCTTTCCG GCGGGGAAGG GCAGCGGATC AGGCTGGCGA CCCAGATCGG CTCCAGCCTC GTCGGCGTTC TCTACATCCT CGACGAGCCG TCCATCGGCC TCCATCAGCG GGACAATGCC CGCCTCCTCC AGACCCTCAA GCATATGCGC GACCTGGGGA ACACGGTGCT CGTGGTGGAG CACGACGAAG AGACGATCCT TGAAGCGGAT CACGTGATAG ACATGGGGCC GGGTGCGGGA GTCCTCGGCG GGCGTGTGGT GGCCCAGGGA ACGCCGGCGG AAATCATGGA AAACCCCGAT TCCCTTACCG GCAAGTACCT GTCCGGCAAA CTTGCCATCG CGGTGCCGAA ACTGCGGAGA AGGCCGGCTA AGTTCCTGCG CATAACCGGC GCCACGGAGA ACAACCTCAA TGACGTCGAG GTGGATATCC CGCTCGGCGT GTTGACCTGC GTGACCGGCG TGTCCGGCTC AGGCAAATCG ACGCTGGTCA TCGACACCCT CTACAAGGTA CTCTGCCAGC GGCTTTACCG GAGCAGGGAG AAAGCAGGGG CGGTGAAACA GATCACCGGG CTTGAGGCGC TCGACAAGGT GATCAACATC GATCAGTCAC CGATCGGCCG AACTCCGCGC TCCAACCCCG CCACCTACAC CGGCGTCTTT GCCGATATCC GCGACATCTT CGCCCAGCTC CCCGAATCGA AGATGCGTGG CTACAAGCCG GGGCGCTACT CGTTCAATGT CAAGGGGGGG CGTTGCGAGG CCTGTTCCGG GGACGGCATC ATCAAGATCG AAATGCACTT CCTCCCCGAC GTCTATGTGC AGTGCGAGGT CTGCAAGGGG GCCCGCTACA ACCGGGAAAC ACTGGAGGTG CGCTACAAGG GGAAATCCAT CGCCGAGGTC CTGGACATGA CCGTCTCCCA GGCGCTGCAG TTCATGGAAA ACATCCCGCG GATCAAGAAC AAGCTGAAAA CCCTGGAGGA GGTCGGGCTC GGTTACATCA AGCTCGGCCA GTCCGCCACC ACCCTTTCGG GAGGGGAGGC CCAGCGCGTC AAGCTGGCCA AGGAGCTCTC CAAGCGGGCC ACTGGCCGGA CCATCTACAT CCTCGATGAA CCGACCACCG GCCTCCATTT CGCCGACATC CAGAAGCTCC TGGAAGTGCT GGAAAAGCTC GTCGAGGGGG GCAACACCAT CGTCATCATC GAACACAACC TGGACGTGAT CAAGACAGCC GACTACCTCA TCGATCTGGG ACCCGAGGGG GGTGACCGGG GCGGCGAGGT GATTGCAACC GGCACACCGG AGGAGGTGGC ACGGGTGACG CGATCCTATA CGGGGATGTT CCTGCGGAAA TTGCTGTAA
|
Protein sequence | MATDKIIIKG ACEHNLKCID VEIPRDQLVV ITGISGSGKS TLAFDTIYAE GQRRYVESLS AYARQFLEQM EKPDVESIEG LSPAISIEQK TTSKNPRSTV GTVTEIYDYL RLLFARVGKP HCYSCGKEIT AQTVSQMVDQ IMAMPEGTRI NLLSPMIRGR KGEYRKELNQ LRKDGFVRVI IDNVPHDLAE EITLDKNKKH DIDIVVDRLI VKEGIQRRLA DSLATALNHA EGIVKVAIQE NPQADSAAEP ANGKGKKAAG KKSLWADTML FSESFACIDC GISYPEMTPR MFSFNNPYGA CPDCTGLGTR MYFDAELVIP NPDLSIREGA IAPWEKRLSG WYHQTLEALA KAYDFDIRTP FKKLPTKARD IILHGSGGDK VEFWWEEDGG RKHIYQKEFE GVLNNLERRY RESESDQVRE ELEKYMNIMP CPTCKGARLK REALFVRIDG HNICDVTALS IKDCLEFFAN LHLTEKEEEI ARRILKEIRE RLHFLVNVGL DYLSLDRSSG TLSGGEGQRI RLATQIGSSL VGVLYILDEP SIGLHQRDNA RLLQTLKHMR DLGNTVLVVE HDEETILEAD HVIDMGPGAG VLGGRVVAQG TPAEIMENPD SLTGKYLSGK LAIAVPKLRR RPAKFLRITG ATENNLNDVE VDIPLGVLTC VTGVSGSGKS TLVIDTLYKV LCQRLYRSRE KAGAVKQITG LEALDKVINI DQSPIGRTPR SNPATYTGVF ADIRDIFAQL PESKMRGYKP GRYSFNVKGG RCEACSGDGI IKIEMHFLPD VYVQCEVCKG ARYNRETLEV RYKGKSIAEV LDMTVSQALQ FMENIPRIKN KLKTLEEVGL GYIKLGQSAT TLSGGEAQRV KLAKELSKRA TGRTIYILDE PTTGLHFADI QKLLEVLEKL VEGGNTIVII EHNLDVIKTA DYLIDLGPEG GDRGGEVIAT GTPEEVARVT RSYTGMFLRK LL
|
| |