Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2038 |
Symbol | |
ID | 8753709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2115164 |
End bp | 2118055 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003409097 |
Protein GI | 284990543 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.225095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGCC TCGTCGTCCG CGGCGCCCGC GAGCACAACC TCAAGGACGT CAACCTCGAC CTGCCCCGCG ACGCGCTGAT CGTGTTCACG GGGCTGTCCG GCTCGGGGAA GTCCAGCCTC GCCTTCGACA CGATCTTCGC CGAGGGTCAG CGCCGCTACG TCGAGTCGCT GTCGGCCTAC GCCCGCCAGT TCCTGGGCCA GATGGACAAG CCCGACGTCG ACTTCATCGA GGGGCTCTCA CCGGCCGTCT CGATCGACCA GAAGTCCACC AACCGCAACC CGCGGTCGAC CGTCGGCACG ATCACCGAGG TCTACGACTA CCTGCGGCTG CTCTACGCCC GCGCCGGCCA GCCGCACTGC CCCAACTGCG GCAAGCCGAT CTCCCGGCAG ACCCCGCAGC AGATCGTCGA CCAGGTGCTG GCGATGGAGG AAGGCACCCG CTTCCAGGTG CTCGCCCCCG TCGTCCGGGC CCGCAAGGGC GAGTACGTCG ACCTGTTCAG CTCGCTGCAG ACCCAGGGCT TCTCCCGGGT CCGCGTCGAC GGCGTCGTCC ACCAGCTGAC CGACCCGCCG AAGCTGAAGA AGCAGGAGAA GCACACGATC GAGGTGATCG TCGACCGGCT CACCGTCAAG GAGAGCGCGA AGCGGCGGCT CACCGACTCG GTCGAGACGG CGCTGGGCCT GGCCGGCGGT CTCGTCGTCC TCGACTTCGT CGACCTGCCC GAGGACGACC CGCAGCGCGA GCGCACCTTC TCCGAGCACC TGGCCTGCGT GGACGACGGG TTGTCCTTCG AGGCGCTCGA GCCGCGGTCG TTCTCCTTCA ACTCGCCGTT CGGCGCCTGC CCCGAGTGCA CCGGCATCGG CACCCGCAAG GAGGTCGACC CCGACCTCGT CGTGCCGGAC TCCGAGAAGA GCCTGGCGCA GGGCGCGATC GCCCCGTGGG CGACCTCGAT GAGCAACGAG TACTTCACCC GGCTGCTCAC CGGTCTCTCG CAGCAGCTCG GCTTCTCCAT GGACGACCCG TGGGGGCGGC TGCCGGCCAG GGTGCAGAAG GCGGTCCTGC ACGGCTCGCC GGACCAGGTG CACGTCCGCT ACAAGAACCG CTACGGCCGC GAGCGCAGCT ACTACGCCGC CTTCGAGGGC GTGCTGCCCT TCCTCGAGCG GCGGCACGAG GACACCGACA GCGAGTACAT GCGGGACAAG TACGAGGGCT ACATGCGCGA CGTGCCCTGC CCCGTCTGCC ACGGCACCCG GCTCAAGCCC GAGATCCTCG CCGTCAAGCT CAACGGCCGT TCGATCGCCG AGGTCACCGG CCTGTCCATC GGCGACGCCT CCGGATGGCT GAACAGCCTG GAGCTCGGCG AGCGCGAGCG GGCCATCGCC GACCGCGTGC TCCGGGAGAT CCAGGCCCGG CTGTCCTTCC TGGTCGACGT CGGCCTGGAC TACCTGTCGC TGGACCGGCC GGCGGCGACG CTGGCCGGCG GCGAGGCGCA GCGGATCCGG CTGGCCACCC AGATCGGGTC CGGACTGGTC GGCGTCCTGT ACGTGCTGGA CGAGCCCTCG ATCGGGCTGC ACCAGCGGGA CAACACCCGG CTGATCGAGA CCCTCGTGCG GCTGCGCGAC ATGGGCAACA CGCTGATCGT CGTCGAGCAC GACGAGGACA CCATCAAGAC CGCCGACTGG GTCGTCGACA TCGGCCCCGG CGCCGGTGAG CACGGCGGCG AGGTCGTCGT CAGCGGCACC GTCGAGGAGC TGCTGGCCAG CGAGCGCTCG CTGACCGGGC AGTACCTCTC CGGGCGCAAG GAAATCGCCG TCCCGCAGGT GCGCCGCCAG CCGACGCCCG GCCGGGAGCT GGTCGTCAAG GGCGCCCGCG AGCACAACCT CAAGGGCGTC GACGTCACCT TCCCGCTGGG CCTGCTCGTC GCGGTCACCG GCGTCTCGGG CTCCGGGAAG TCCAGCCTGG TCAACGACAT CCTCTACACG ACCCTGGCCA ACGAGCTGAA CCGCGCGCGG ATGGTGCCCG GCCGGCACCG CACCATCACC GGCCTGGACC AGCTCGACAA GGTCGTGCAC GTCGACCAGT CGCCCATCGG CCGCACCCCG CGGTCCAACC CGGCGACCTA CACCGGCGTC TGGGACCACG TGCGCAAGCT GTTCGCCAGC ACGTCGGAGG CGAAGATCCG CGGCTACCAG CCCGGCCGGT TCTCCTTCAA CGTCAAGGGC GGTCGCTGCG AGGCCTGCTC CGGCGACGGC ACGCTGAAGA TCGAGATGAA CTTCCTGCCG GACGTCTACG TGCCGTGCGA GGTGTGCAAG GGCGCCCGGT TCAACCGGGA GACCCTCGAG GTGCACTACA AGGGCCGGAC CGTCGCCGAG GTCCTGGACA TGCCCATCGA GGAGGCGGCC GACTTCTTCG CCGCCATCCC GGCCATCTCC CGGTACCTGC GCACGCTGAC CGAGGTCGGG CTGGGGTACG TCCGGCTCGG CCAGCCGGCG ACCACCCTCT CCGGCGGCGA GGCGCAGCGC GTCAAGCTGG CCAGCGAGCT GCAGAAGCGG TCCAACGGCC GCAGCATCTA CGTGCTGGAC GAGCCCACCA CCGGGCTGCA CTTCGAGGAC ATCCGCAAGC TGCTGATCGT GCTGCAGGGC CTCGTCGACA AGGGCAACTC CGTCATCGTC ATCGAGCACA ACCTCGACGT CATCAAGAGC GCCGACTGGC TGATCGACAT GGGCCCCGAG GGCGGCTTCC GCGGCGGTAC GGTCGTCGCG GAGGGGCCGC CGGAGTTCCT CGCCACGGTC CCGGAGAGCC ACACCGGCCG CTACCTGGTC CCGGTGCTCG CCCCGGACGC GGTCGCGGCC GCGGCGGCGC CGAAGAAGCG GGTCCGCAAG AAGGCCAGCT GA
|
Protein sequence | MDRLVVRGAR EHNLKDVNLD LPRDALIVFT GLSGSGKSSL AFDTIFAEGQ RRYVESLSAY ARQFLGQMDK PDVDFIEGLS PAVSIDQKST NRNPRSTVGT ITEVYDYLRL LYARAGQPHC PNCGKPISRQ TPQQIVDQVL AMEEGTRFQV LAPVVRARKG EYVDLFSSLQ TQGFSRVRVD GVVHQLTDPP KLKKQEKHTI EVIVDRLTVK ESAKRRLTDS VETALGLAGG LVVLDFVDLP EDDPQRERTF SEHLACVDDG LSFEALEPRS FSFNSPFGAC PECTGIGTRK EVDPDLVVPD SEKSLAQGAI APWATSMSNE YFTRLLTGLS QQLGFSMDDP WGRLPARVQK AVLHGSPDQV HVRYKNRYGR ERSYYAAFEG VLPFLERRHE DTDSEYMRDK YEGYMRDVPC PVCHGTRLKP EILAVKLNGR SIAEVTGLSI GDASGWLNSL ELGERERAIA DRVLREIQAR LSFLVDVGLD YLSLDRPAAT LAGGEAQRIR LATQIGSGLV GVLYVLDEPS IGLHQRDNTR LIETLVRLRD MGNTLIVVEH DEDTIKTADW VVDIGPGAGE HGGEVVVSGT VEELLASERS LTGQYLSGRK EIAVPQVRRQ PTPGRELVVK GAREHNLKGV DVTFPLGLLV AVTGVSGSGK SSLVNDILYT TLANELNRAR MVPGRHRTIT GLDQLDKVVH VDQSPIGRTP RSNPATYTGV WDHVRKLFAS TSEAKIRGYQ PGRFSFNVKG GRCEACSGDG TLKIEMNFLP DVYVPCEVCK GARFNRETLE VHYKGRTVAE VLDMPIEEAA DFFAAIPAIS RYLRTLTEVG LGYVRLGQPA TTLSGGEAQR VKLASELQKR SNGRSIYVLD EPTTGLHFED IRKLLIVLQG LVDKGNSVIV IEHNLDVIKS ADWLIDMGPE GGFRGGTVVA EGPPEFLATV PESHTGRYLV PVLAPDAVAA AAAPKKRVRK KAS
|
| |