Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0539 |
Symbol | |
ID | 3744418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 630807 |
End bp | 633677 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637768578 |
Product | excinuclease ABC subunit A |
Protein accession | YP_374466 |
Protein GI | 78186423 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.111546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.557 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATC ATAGGCTCAA GGGTTCGGCG CCGGCTGAAC CGAGTCTGCC GGATATAGTC CTCAAAGGCA TCACCACCCA CAACCTCCAG AACATCTCCG TCCGTATTCC TCGAAACCGG TTCGTGGTGC TCACCGGCGT CAGCGGCTCG GGCAAATCGA GCCTTGCTTT CGATACCCTT TATGCCGAGG GGCATCGTCG CTATGTCGAG TCGCTCTCGG CCTATGTGCG CCAGTTTCTC GAGCGCATGC CGCGCCCCGA AATCGAGGTG GTTGAGGGGA TCGCTCCGGC CATTGCCATA GAACAGCGCT CCATACCGCG CAACCCCCGC TCGACGGTCG GTACCGTTTC GGAGATATAC GACTACCTTC GCCTGCTTTA TGCCCGCATC GGCAAGATCT ATTCCCGCGA CACCAATGAA CTGGTGCTGA AGCATACCCC GGACGATGTG GTCATGCAGG CCGGCTTTTT TCTGGAAGGA ACGAAGTTCT ATGCGGGATT CCTTTTTCCT TCCCATGAGG ACGGCAGTCA TCATCTCTGC ACAGTCGATG AGGAAATTTC AAACCTGCTG AAGAAGGGAT TTTTCCGGGT GGTGTCCGGC GACACCGTAC TCGACCTTAA CAGCCCGGAC GACTGCAGGA AGGTCGCCAT GATGAGCGAA GCCGCCCGCA GAGCACTGCT TGTGCTTGTC GACCGGTTTG TGACCCGTCA TGACAAGAAG TTCCAGAGCC GCGTTGCCGA GGCGGCTGAG TCCTGCTTCA GCGAGTCGGG CGGCCATGCG GTGCTGCAGG TGGTCGGAGG GAAGACCTAC CGGTTCAGCG ACCGGCTGGA GCTCAATGGC ATTGAGTACC AGGAGCCCTC CCCGCAGCTG TTCGCCTTCA ACTCCCCGAT CGGTGCCTGC ACCACCTGCC AGGGATTCGG GCGCATCGCC GGCATTGATG AACATGCCGT CGTTCCCGAC CGTTCGCTGA CGCTTGCAGA AGGTGCAATC GTCTGCTGGA ACTCGGAAAA GTACCGATGG AACCTCGAGG AACTGCTTCT TGCGGCTCCG AAAGCCGGCA TCCCGGTCGA TGTTCCCTAT GAAAAGCTGT CGCATGCAGA AAAGGAGCGG ATCTGGAAGG GTGTGCCGGG CACCGGGTAC CGCGGCATAT GGCCGTTTTT TGCTGAAATC GAGAAAGATG CCGGCTATAA GATGCATTTC AGGGTGTTCC TCAGCCGTTA CCGCGGCTAT GCGATCTGCC CCGACTGCGA AGGCTCCCGT CTCAACCCCG ATGCCCGCCT TGTCCGTGTA TCCGGCCGGA ATATCACTGA TGTGGCCCGT ATGAACATCG CCGATGCCCA TGCGTTCTTT AAAAACCTTG AGATCTCTCC GTTCGACCGC AAGGTGGCAG AATCGGTGAT CGAGGAGATC GAGAAGCGGC TCGGCTACCT GCTTGACGTG GGCCTCGACT ACCTGACGCT CGACCGCCTT ACCCACACCC TCAGCGGCGG CGAATTCCAG CGCATCAACC TTTCAACTTC GCTCGGTTCT CCTCTGGTCG GCGCCATCTA CATTCTCGAC GAGCCGAGCA TCGGTCTGCA CCAGAGCGAT TCGGCCCGCC TCATCACACT GCTCCAGAAA CTTCGCGATC TCGGCAACAC GGTCGTCGTT GTCGAGCACG ATCGCGAAAT CATCGAGGCG GCCGACGAGG TGATCGATCT CGGTCCCCAG GCCGGCAGGC TCGGCGGAGA GGTGGTGTTC CACGGTTCAG TCGAGGAGAT GAAGCGTTCC GGCACTTCAA TGACGGCCGA GTACCTGCAG GACCGCCGTT CCATTCCGTT GCCGGAAGTG CGCAGGACTC CGGACTTTAC GGCCTGCATC GAAATCAACG GGGCCATGCA GAACAACCTT AAGAACATCG ACGTGCGTTT CCCGCTTGGT GTCATGACCT GCGTCACCGG CGTGAGCGGT TCTGGCAAGT CGACGCTCGT CAACGACATC CTCAACCGGG GCCTCATGCG CGAGAAGGAG GGTGTGAAGG ACGAGGTGGG CACCCACCGC GCGATTACCG GCGGGTGGCA GATAGACCGT GTCGAGCATG TGGACCAGTC GCCCATCGGC AAGTCAAGCA GGAGTAACCC GGTGACCTAT CTGAAGATCT TCGACGATAT CCGCTCCGTA TTCGCCCAGA CTCCCGACGC GAGAGCACGG GGGTGGAAGG CGGGGTATTT TTCGTTCAAT ATTCCCGGCG GCCGCTGCGA GGCCTGTTCG GGAGAGGGGA GCGTGAAGAT CGAAATGCAG TTTCTCGCCG ACATCGAAGC GGTCTGCGAG GAGTGCGGCG GACTGCGCTA CAAGGCCGAT ACGCTGGAGG TGAAGTACCG CGGCCGTTCG ATTGCAGAGG TGCTCGATAT GACCGTGGAA GAGGCGCTCG TGTTTTTCGC CCGGGAAAAA AGCATATCCC GGAAGCTCGG TGTGCTTGAC GAGGTCGGAC TCGGCTATAT CCGCCTCGGC CAGTCCTCAA GCACCCTTTC CGGCGGCGAA GCCCAGCGCC TGAAGCTCGC GAGCTTCATT GCGCGGGCCG ATGTCGAGCA CACCCTCTTC CTTTTCGACG AGCCGACAAC AGGCCTGCAT TTTGAGGATA TCCGCAAACT TCTGGGGTGT TTTTCGAAGC TGCTTGAGCA ACACAACACC CTCATCGTCA TCGAACACAA CCCCGACATC ATTTCCCAGG CGGACTGGGT GATCGACCTC GGTCCGGGTG CGGGAGACCG GGGGGGCCAG GTGATCGGCG AAGGTACGCC GGAAGATCTT GCCGGGATGG AGCAGTCGCT CACCGGCCGT CACCTGCAGC CGCTCTTCAG GAGGCTCGGA AAGACGAAGA AGGTGACCTA G
|
Protein sequence | MTNHRLKGSA PAEPSLPDIV LKGITTHNLQ NISVRIPRNR FVVLTGVSGS GKSSLAFDTL YAEGHRRYVE SLSAYVRQFL ERMPRPEIEV VEGIAPAIAI EQRSIPRNPR STVGTVSEIY DYLRLLYARI GKIYSRDTNE LVLKHTPDDV VMQAGFFLEG TKFYAGFLFP SHEDGSHHLC TVDEEISNLL KKGFFRVVSG DTVLDLNSPD DCRKVAMMSE AARRALLVLV DRFVTRHDKK FQSRVAEAAE SCFSESGGHA VLQVVGGKTY RFSDRLELNG IEYQEPSPQL FAFNSPIGAC TTCQGFGRIA GIDEHAVVPD RSLTLAEGAI VCWNSEKYRW NLEELLLAAP KAGIPVDVPY EKLSHAEKER IWKGVPGTGY RGIWPFFAEI EKDAGYKMHF RVFLSRYRGY AICPDCEGSR LNPDARLVRV SGRNITDVAR MNIADAHAFF KNLEISPFDR KVAESVIEEI EKRLGYLLDV GLDYLTLDRL THTLSGGEFQ RINLSTSLGS PLVGAIYILD EPSIGLHQSD SARLITLLQK LRDLGNTVVV VEHDREIIEA ADEVIDLGPQ AGRLGGEVVF HGSVEEMKRS GTSMTAEYLQ DRRSIPLPEV RRTPDFTACI EINGAMQNNL KNIDVRFPLG VMTCVTGVSG SGKSTLVNDI LNRGLMREKE GVKDEVGTHR AITGGWQIDR VEHVDQSPIG KSSRSNPVTY LKIFDDIRSV FAQTPDARAR GWKAGYFSFN IPGGRCEACS GEGSVKIEMQ FLADIEAVCE ECGGLRYKAD TLEVKYRGRS IAEVLDMTVE EALVFFAREK SISRKLGVLD EVGLGYIRLG QSSSTLSGGE AQRLKLASFI ARADVEHTLF LFDEPTTGLH FEDIRKLLGC FSKLLEQHNT LIVIEHNPDI ISQADWVIDL GPGAGDRGGQ VIGEGTPEDL AGMEQSLTGR HLQPLFRRLG KTKKVT
|
| |