Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1331 |
Symbol | |
ID | 7292778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 1480928 |
End bp | 1487050 |
Gene Length | 6123 bp |
Protein Length | 2040 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643589737 |
Product | Fibronectin type III domain protein |
Protein accession | YP_002487410 |
Protein GI | 220912101 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000000211706 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAAGTC TGCTGGGGAA ACTTGGGCTC AAACGGCGCC ACAAGAAGAT CGTCACCGGT ACAGCCTTCG CCGCTGCCGT GGCCGTCGTG GCGACCGGCG CCGTGCTCTA TCCGGGGTTT AAGACCACTG AGGTGGAGTT GAATGACGGT GGTGTGTGGG TGGTGTCGAA GTCCAAGAAT GCGGTGGGGC GGTTGAATTA TCCGTCGCGT GTGTTGGATG GGGCGGTGAC TCCGGCGTCG TCGACGTTTG ATATTTTGCA GGATGCCGGT GAGGTGTTTG TTGATGATGA GTCTGGTTCG ACGTTGAATC AGGTGTCGCC GGCGAATATG CGTTTGGGTG GGGATAAGCA GTTGCCGGGG GCTGCGGATG TGAGTTTTGG GTCGTCGGTG TTGTCGGTGA CGGATGCGGC GTCGGGCAAG GTGTGGGCGG TGTCGCCGTC GACGGTGAAT GGTTTTGATG AGGAAGCGTC GGAGCCGGTG TTGGTGGGGT CTGAGGGTAC GGTGTCGGCG GTGGGTGCTG ATGATCGTAT TTATTCGGCG GATCCGAAGG CCGGGACGGT GACGGTGACC GGTGTTGATG CCAATGGTGT GGTGGTGTCT TCGGATGCTG AGTCGTGGTC TGAGTTGAAG GGTGCGGGGG ACCTGCAGAT TACGGTGGTG GGGGACCGGC CGGTGGTGTT GGATGCTGCG GCGGGGAGTT TGTTTTTGCC TGGTGGTAAG CGGTTGCAGT TGGCTGATGC GCGGGATGCG AAGTTGCAGC AGGGTGGTCC GGGCAGTGAT TTTGTGGCGA TTGCGACGCA GAAGGCGTTG TTGAAGCAGC CGTTGGATGG TGGGACGGCG AAGACGGTGT CGTTTGGTGG TGAGGGTGTT CCTGCGGCTC CGGTGCAGTT GGGTGGGTGT GTGCATGCTG CGTGGTCGGG GGCGAATAAG TATGTCCGTG ATTGTGTGAA TGATGCTGAT GATAAGAACG TGGATGTGCC CAAGGCGAGT GCGTCGCCGT CGTATGTGTT TCGGGTGAAC CGGGACCTGG TGGTCCTCAA TGATGTGAAT TCCGGGAATG TGTGGCTGGT GAACCAGAAC ATGCAGCTGG TCAACAACTG GGACGACGTC GTCCCGCCGA AGAACGAATC CGACGAGCAG GACCAGGAGT CCGCGGATAT CAACACCATC AACGTGTTGC CGGACCGCAC CAAACCGAAC CGTCCGCCGG AAACCAAGCC CGACGTCGTC GGCGTCCGTC CGGGCCGCAC CACCATCCTG AGCGTCCTGG ACAATGATTC CGATCCCGAC GGCGACGTCC TCACCGCCGG GCTCCAGGGC AACCCGCCGA AGGCAGGGAC GCTGGAGAAC ATCTACGGCG GCACCGCCTT CCAGATTTCC GTGCCGGCCG ACGCCAGGCC CGGCACGGAG ACCTTCAGCT ACTCGGCGTC CGACGGCCGC GGCCTCTCCG CCACCGGACA GGTCACCCTC AACGTGGTGG GCCCGGACCA GAACAAGCCA CCGCAGTTCA AACGCGGCGA GAACACCACC ATGCTGGTGG AACAGGGCAA GACCGTGAGC CAGAACATTC TGACGGACTG GGTGGACCCC GACGGCGATG ACCTGGTGCT CCTCGACGCC AAGGCCGACA ACGAGCAGGA CCAGGTGAAG GTCCGCCGTG ACGGCCTGCT CACTTTCCAG GACTCCGGCG CCACCTCCGG CAAAAAGAAC GTTGAGGTGA CCATCTGGGA TGGCCGCGAC ACCGTCACCG GAAAGGTGGT CATCAACGTG CAGCCGCCGG GTGCCCTCGC ACCGGTAGTC AACGCTGACC ATGTCACAGC CGTAGTGGGC CAGGACCTGG TGATCTCGCC GCTGAAGAAT GACGTGGACC CCAACGGCGG CGCGCTCCGC CTCGCCCAGG TGGAAGCGAA CGGGCCCGCC GATCTCGGCC CCGTGACCGA CGGCGGAACC TTCACGTTCC GCAGCACCAC CCCCGGCCCC GTCTACCTCA CATACATCGC CAGCAATGGC CCGCAGAGCA GCCAGGGCCT GATCCGCGTG GACGTCGAAT CCGGTGACGA TCCCGGCGAC CCCGTCGCTG TCCACGATGT CGCCCTGATG CCCACCGGCG GCAGCGTCCT GCTGGACCCG CTGGCAAACG ACTCCGACCC CTCCGGCGGC GTCTTGGTGC TGCAATCCGT GAAGCTTCCG GAAAACGCCA CCGTCTCAGT CAGCGTGATC AACCACAGCG TCCTGCGCAT CACGGACATC CTCGGAACCA AGGACCCGAT CCTCTTCGAA TACACCATGT CCAACGGCAA GAAGTCGGCC ACAGGTTCCG TCTCCGTGGT ACCCGTCCCG GCACCGGCCG TGGTGGAAGC ACCCCAGCCC AAGCCGGACG AGGTCAATGT CCGCGTCAAC GACGTGGTCA CCATTCCCGT CCTGGACAAC GACACGCACC CGCAGGGCCA GGAGCTGACC GTCGATCCGG TGCTCCCGCA GGGTGTGGAC CCGGTCGACG GCAAAAGCTT CGTCTCCGAA AACACCCTCC GGTTCATCGC CGGCAGCCAG CCCAAGACGG TCCGCGCCAT CTACAATGCG GTGGACCCGC AGGGCCAAAA GAGCGCCGCC GCCGTCACCA TCCACATCCT CCCGCTCGAA GGCGCGGAGA ATTCCCGCCC GCAGCCGCGC AACCTCACGG CCCGCGTGGT GGCGGCCGGC ACCGTCCGCA TTCCGGTGCC GCTGGACGGC ATCGATCCCG ACGGCGACTC GGTCCAGCTG ACCGGCATCG ACAGCACGCC TGCCATGGGT ACGGCCACCG TGGGCAGCAA CTTCATCGAC TTCACGGCTG CCGGAGACGG CGCCGGGACC GATACCTTCC GCTACAAGGT GGTGGACCGG CAGGGTGCCG TCAATACGGG CACCGTCACC GTGGGCATTG CACCGCGCGG CGAAATCAAC CAGAAGCCCA CTCCCGTGGA CGACGAGGTG CGGGTCCGCC CCGGCCGCCA GATCGCCGTC GACGCCACCG GCAACGACAC CGATCCCGAC GGCGACCAGA TCCGCATCCT CACCGACGGC ATCGAGGCTG ATCCTGCCCT CCAGGCCACC GTGAGCAAGA CCAGCGGACG GGTCATCCTG ACCGCCCCCA ACGAGGCCGG GACCGTCAAC GTCCGGTACA CCGTGGCCGA TGACCGGGAC GCCCGGGCAC AGGCCACCAT CCGCGTGGTG GTGGACCCGG AGGTCCCGCT CAAGGCACCC ATCGCCCGTG ACGACCGTGT GACATCAGCC CAGACCATGG GCAAGACCGC CGTGGACGTT CCCGTCCTGA AGAACGATGA AGACCCCGAC GGCGTGGGCG AAAACCTCAA GATCAGTACT GAGGCAACCA CGGCCCGGCC CGGCGCTGAA GGCAACATGA TCGTTGACCT CACCGAGCAG CCGCAGCTCA TCCCGTACAC CGTGGAGGAC GTTGACGGCC AGAAGTCCAC CGCCATCATC TGGGTGCCGG GCCTCGGCCA GCAGGTTCCC ACTCTGGCAA AGGACGAGGT CCTGGAAGTG GTGGCGGGAC AGACCGTCAA TGTTGACCTG GACGAATGGG TCAAGGTCCG GGAAGGCCGC TCGCCGCGCC TGACGCAGGC GGACCGGATC AAGCTCATCG GAGCCGACGG CAGCGACCCC GTCACCGGTG ACGGCACGGG ACTGAAGTAC ACCGCAGGCA CTGACTACGT GGGTCCGGGT TCACTGACCT TCGAAGTCAC CGACGGCACC GGACCGGACG ATCCCGCCGG CCTGAAGTCG ACGCTCAGCA TCCGCACCAA GGTCCTTCCG GACCCCAACA AGAACAACCC GCCTGAACTC CTGGGCGCGA ACGTCGAGGT GCCCATGGGC GACTCGGCAA GCATCGACCT CGGGAGGCTG ACCTCGGACC CCGACGGGGA CGACGTGGAC AACATGAAGT ACGAGCTGGT GGGCGGTGGT GCCGCCGGCT TCAACGCGAG CGTGGACGGC AGGAACCTCA AAGTCTCCGC CGCGGACTCC AGCCGGGCCG GCACCGCCGG TGCCGTGCAG GTCAAGGCCA GGGACCCGCG CGGCCTGGAA GCCACGGCCA CGTTCCAGCT GTCCGTGACG GCATCCAACC GGCCCAAACC GGTAGCTAAC GACGACGTTG AACCTAACGC CGCCGCCGGC AAGCCCGTCA CCATCAATGT CCTGGCCAAC GACGCCAACC CGTTCCCCGA GACGGCGCTG AAGATCATTG CCGCGGCTAC CGAGACCGGC AGCGGCAACG TTGAAGTGAA CGGCGATTCG GTGACGGTCA CACCCGCCCC CGGCTTCACC GGCACCATGG TGGTCACCTA CACGGTGGAA GACAAGACCC AGGACGCATC GCGGCACGCC ACGGCACGCG TCCGGCTTAC GGTGAAGGAC AAGCCGGCGG CCCCCACTAC GCCGCAGGCG CAAAGCGTGG GGGACCAGAC GGCGCTGCTG ACCTGGGCCG CTCCCGCGGA CCGCGGTTCA CCCATCACCA AGTACACCGT GTACGGCGAG GGCGGATTCA AACAGGACTG CCCGGCCAAT ACCTGCACGC TGAACGGGCT GGTCAACAAC ACGAAGTACC ACTTCCAGGT CACCGCCACC AACGAGTTCG GCGATTCGGA CCGTTCGCCG GCTTCGGCCG AAGTACGGCC GGACGTCAAG CCCGACACCC CGCTGGCGCC GTCGCTGAAG TTTGGCGACA AGGAACTGTC CATCAACTGG ACTGCTCCTG CCAGCAAGGG TTCGCCGGTA AAGTCCTACG ACCTGGAGAT CTCCCCGCCG CCCGCGGGGC AGAACGCCCA GATCCAGAAC CTGACCGCAG TCAGCTATGT CTGGAAGGGC CTGCAGAACG GCGTGTCCTA CAAGGTGCGG GTCCTGGCGC GGAACGACGC CAAGGAACCG TCCGAGTGGA GCCCGTACTC CGCGGCTGAA GTGCCGGCCG GTGTGCCGGC CACCCCGGCA GCACCCACGG CCGCCCAGGC GGCGTCCCTG GGGTCGCAGA GCCAGCTGAA GGTCAGCTGG GCCGCGCCGA ACAACAACGG TGACGCCGTC TCTGCCTACA CCCTCACCAC CCTCCGGGGC GGTGCGGTGG TGGCCACCCA GCAGGTGGCC GGCACCTCGC AGAACGTGAC AGTGGACAAC TCCGAGGCCA ACTACACCTT CACGGTCTCC GCCACCAACA AGGCGGGAAC CAGCGCCACC AGCGGACAGT CGGCGGCCAT CCGGGCGGTG GGCAAGCCCG GCATCGTTGG GACTCCGACG GCGACGCTGG TGGACACCGG TGGAAACGGC GGCAAGATCG ACGTGAGGTT CCCGGTGCTG AGCGATGCCC AGCGCAACGG GTCCACCCCG CAGGAAATCA CCTACAGGTA CAGCCTGACC TCAGGCGGCG GCAGCGGCAG CATCGCTGCC GGCGGCGGGA CGGTTGCCGC CGCCAATGGC ACGCCTACCT CGGTGGTGGT CTGGGCGGTC TCGTCCCGCA GTTCCACCGC CGGTGACGCC AGCGCTCCGT CCAACCAGGT GAATCCGTAC GGCCTGGCCT TTGCTCCTAA CGTGAACGGC AGCAAGAGCA GCGGCGAGGG TGACAAGACA GTGTCCTGGA CCTGGAACCA GCCGGACGGC AATGGCCGCG CCGTCACGGG ATACCAGTAC AGCCTGGACG GCGGGGGATG GCAGGACACC AACCAGCGCT CCTTCTCCAA GACGGTCGGC TTCAGCGAAA CCCACACCCT TCGCGTTCGG GCCATCAGCG GCGGCCAGCC GGGCCGGATC GGCAGCGATA CGTCCCGCAG CGGCGCCGAG CCGCCGCCCC CGGTGCCCAC GTCGTGGAGC ATCACTGCCA CCCCGACCCG CAGCTGCACG GAGCCCCGCA AGGGTACGGA CAGCTTTGTG GCCGGCAACC CGTCGCAGTG CAACGGGGCA GGCAAATGGC TGGACGCCGG CGCTCCGGCG GACACGGACA GGTACCAGGT TTGGTACAAG ACGTCGAACA ACCCCACCGG CATCTGGTAC CACCTCACCA GCGGCATGGC CGCCGGAAAC TGGATCCGCT GCGATACGTC CAGCCGCGGC TGCAACCCGC CCGCCGGGAT GCCCAACCGC TAG
|
Protein sequence | MTSLLGKLGL KRRHKKIVTG TAFAAAVAVV ATGAVLYPGF KTTEVELNDG GVWVVSKSKN AVGRLNYPSR VLDGAVTPAS STFDILQDAG EVFVDDESGS TLNQVSPANM RLGGDKQLPG AADVSFGSSV LSVTDAASGK VWAVSPSTVN GFDEEASEPV LVGSEGTVSA VGADDRIYSA DPKAGTVTVT GVDANGVVVS SDAESWSELK GAGDLQITVV GDRPVVLDAA AGSLFLPGGK RLQLADARDA KLQQGGPGSD FVAIATQKAL LKQPLDGGTA KTVSFGGEGV PAAPVQLGGC VHAAWSGANK YVRDCVNDAD DKNVDVPKAS ASPSYVFRVN RDLVVLNDVN SGNVWLVNQN MQLVNNWDDV VPPKNESDEQ DQESADINTI NVLPDRTKPN RPPETKPDVV GVRPGRTTIL SVLDNDSDPD GDVLTAGLQG NPPKAGTLEN IYGGTAFQIS VPADARPGTE TFSYSASDGR GLSATGQVTL NVVGPDQNKP PQFKRGENTT MLVEQGKTVS QNILTDWVDP DGDDLVLLDA KADNEQDQVK VRRDGLLTFQ DSGATSGKKN VEVTIWDGRD TVTGKVVINV QPPGALAPVV NADHVTAVVG QDLVISPLKN DVDPNGGALR LAQVEANGPA DLGPVTDGGT FTFRSTTPGP VYLTYIASNG PQSSQGLIRV DVESGDDPGD PVAVHDVALM PTGGSVLLDP LANDSDPSGG VLVLQSVKLP ENATVSVSVI NHSVLRITDI LGTKDPILFE YTMSNGKKSA TGSVSVVPVP APAVVEAPQP KPDEVNVRVN DVVTIPVLDN DTHPQGQELT VDPVLPQGVD PVDGKSFVSE NTLRFIAGSQ PKTVRAIYNA VDPQGQKSAA AVTIHILPLE GAENSRPQPR NLTARVVAAG TVRIPVPLDG IDPDGDSVQL TGIDSTPAMG TATVGSNFID FTAAGDGAGT DTFRYKVVDR QGAVNTGTVT VGIAPRGEIN QKPTPVDDEV RVRPGRQIAV DATGNDTDPD GDQIRILTDG IEADPALQAT VSKTSGRVIL TAPNEAGTVN VRYTVADDRD ARAQATIRVV VDPEVPLKAP IARDDRVTSA QTMGKTAVDV PVLKNDEDPD GVGENLKIST EATTARPGAE GNMIVDLTEQ PQLIPYTVED VDGQKSTAII WVPGLGQQVP TLAKDEVLEV VAGQTVNVDL DEWVKVREGR SPRLTQADRI KLIGADGSDP VTGDGTGLKY TAGTDYVGPG SLTFEVTDGT GPDDPAGLKS TLSIRTKVLP DPNKNNPPEL LGANVEVPMG DSASIDLGRL TSDPDGDDVD NMKYELVGGG AAGFNASVDG RNLKVSAADS SRAGTAGAVQ VKARDPRGLE ATATFQLSVT ASNRPKPVAN DDVEPNAAAG KPVTINVLAN DANPFPETAL KIIAAATETG SGNVEVNGDS VTVTPAPGFT GTMVVTYTVE DKTQDASRHA TARVRLTVKD KPAAPTTPQA QSVGDQTALL TWAAPADRGS PITKYTVYGE GGFKQDCPAN TCTLNGLVNN TKYHFQVTAT NEFGDSDRSP ASAEVRPDVK PDTPLAPSLK FGDKELSINW TAPASKGSPV KSYDLEISPP PAGQNAQIQN LTAVSYVWKG LQNGVSYKVR VLARNDAKEP SEWSPYSAAE VPAGVPATPA APTAAQAASL GSQSQLKVSW AAPNNNGDAV SAYTLTTLRG GAVVATQQVA GTSQNVTVDN SEANYTFTVS ATNKAGTSAT SGQSAAIRAV GKPGIVGTPT ATLVDTGGNG GKIDVRFPVL SDAQRNGSTP QEITYRYSLT SGGGSGSIAA GGGTVAAANG TPTSVVVWAV SSRSSTAGDA SAPSNQVNPY GLAFAPNVNG SKSSGEGDKT VSWTWNQPDG NGRAVTGYQY SLDGGGWQDT NQRSFSKTVG FSETHTLRVR AISGGQPGRI GSDTSRSGAE PPPPVPTSWS ITATPTRSCT EPRKGTDSFV AGNPSQCNGA GKWLDAGAPA DTDRYQVWYK TSNNPTGIWY HLTSGMAAGN WIRCDTSSRG CNPPAGMPNR
|
| |