Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1287 |
Symbol | |
ID | 4446211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1432911 |
End bp | 1438946 |
Gene Length | 6036 bp |
Protein Length | 2011 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689095 |
Product | fibronectin, type III domain-containing protein |
Protein accession | YP_830781 |
Protein GI | 116669848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.111865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGGTG CGATTATTTA TCCGGGGTTC AAGACCACTG AGGTGGAGTT GAACGACGGC GGTGTGTGGG TGGTCAGTAA GACGAAGAAT GCTGTTGGCC GGTTGAATTA TCCGTCGCGT GTCCTCGATG GTGCGGTGAC GCCGGCGAGT ACCACGTTCG ATATCCTGCA GAACTCCGGG AATGTCTTTG TTGACGATGA GACCGGCTCG ACTCTGAATC AGGTGTCGCC GGCGAATATG CAGCTGGGCG GGGATAAGCA GTTGCCGGGT TCGGCTGATG TCAGTTTCGG TTCCGCGGTG ATTTCGGTGA CGGATGCGGC CAAGGGCAAG GTGTGGGCGC TTTCGCCGTC CACGGTGAAC GGTTTTGACG AGGAATCCAC GGAACCGGTG CTGGCCGGTT CGGAGGGTTT GGTGTCCGCT GTCGGGACGG ATGACCGGAT TTACAGTGCG GATCCGAAGA CGGGTGTTGT GACTGTGACG GCTGTTGATG CCAATGGTGA GGTGGTGTCT TCGGAGTCGG GCACGTGGGA CGGGCTGAAG GGTGCCGGTG ATCTGCAGCT GGCGGTGGTG GGGGATAAGC CTGTGGTCCT GGATGCGGGG CGGGGGAAGT TGTTCCTGCC GGGCGGTCGT GGGCTGGATT TGGAGAACGC GCGGGATGCG AAGTTGCAGC AGTCCGGTCC GGCGTCGGAT GTTGTTGCGG TGGCGACGCA GAAGGCGTTG TTGAAGCAGC CGTTGGATGG TTCGGCGGCG AAGACGGTGT CCTTTGACGG TGAGGGTGTT CCGGCGGCTC CTGTGCAGTT GGGCGGGTGT GTTCATGCGG CGTGGTCGGG GGCGAATAAG TATGTCCGTG ATTGTGTGAA TGATGCTGAT GATAAGAACG TTGAGGTGCC GAAGGCGAGT GCATCGCCGT CGTATGTGTT CCGGGTGAAC CGGGACCTGG TGGTGTTGAA CGATGTGAAC TCGGGCAATG TGTGGCTGGT GAACCAGAAC ATGCAGCTGG TCAACAACTG GGACGACGTC GTCCCGCCCA AGAACCAGTC CAACGACCAG GACCAGGAGT CCGCGGACAA CAACACGATC AACATCCTGC CGGACCGCAC GAAGCCCAAC CGCCCGCCGG AAACCAAACC CGACACTGTC GGCGTGCGGC CCGGGCGCAC CACAATCCTC AGCGTGCTGG ATAACGACTC CGATCCCGAC GGCGACGTCC TGACGGCATC CGTGGTCGGT TCCGGGCCGG CGTCGGGCAC GCTGCAAAGC ATCTACGGCG GCACGGCTTT CCAGATCACG GTTCCGGCCG ACGCCAAGCC CGGCTCCGAA ACGTTCGGCT ACAACGCTGC GGACGGACGC GGGCTCTCCG CCGGCGGACA GGTTTCCCTG AACATCGTCG GCGCTGACGA GAACAAACCA CCCCTATTCA AGCGGGGCGA CCCCACCACC ATGCTGGTGG AGCAGGGCAA AACCGTCAGC CAGAACATCC TGACCGACTG GACCGACCCC GACGGCGATG ACCTCGTGCT GCTGGACGCC AAGGCGGACA ACGAGCAGGA CCAGGTCAAG GTCCGCCGCG ACGGGCTGCT CACCTACCAG GATTCCGGGG CCGCCTCCGG CAAGAAGACC GTCACGGTGT CCGTGTGGGA TGGCCGTGAC ACCACTACCG GGCAGGTGGT GGTGAACGTG CAGCCGCCGG GTGCCCTGGC GCCCGTGGTC AACGCGGACC ACGTCACCGC CGTCGTCGGC CAGGACCTGG TGATCGCACC GCTGAAGAAC GACGTCGACC CCAACGGCGG AGCCCTCCGG CTCGCCCAGG TGGAAGCGTC CGGCCCGGCC GAACTCGGGC CCGTGACCGA CGGCGGTACG TTCACGTTCC GCAGCGAGAC CGCCGGTCCC GTCTACCTCA CGTACATTGC CAGCAACGGT CCGCAGAGCA GCCAGGGACT CATCCGCGTG GACGTGGAAT CAGGCAAGGA CACCGGCGAC CCCGTCGCAG TCCATGACGT CGCCCTGATG CCCACGGGCG GCAGCGTGCT CCTCGACCCT CTGGCCAACG ACTCCGATCC CTCCGGAGGG GTGCTGGTGC TCCAGTCGGT CCAGCTTCCC GAGAACACCA CAGCCTCGGT CAGCGTGATC GACCACAGCG TGCTGCGAAT CACCGACGTG CTGGGCACCA AGGACCCCTT CCTGTTCCAG TACACGATGT CCAACGGAAG GAAGTCCGCC ACCGGCAGCG TCTCCGTAGT CCCGGTGCCG GCGCCCGCCG TCGTGGAAGC ACCCCAGCCG AAACCGGATG AGGTGAACGT CCGCGTCAAC GACGTCGTCA CCATTCCCGT GCTGGCCAAC GACACCCATC CGCAGGGACA AAAACTAACG GTGGATCCGG TGCTGCCGCA GGCAGTGGCG GAAGCGGACG GCAAGGGCTT CGTTTCCGAA AACACGCTCC GGTTCATCGC CGGGCCCCAG CCCAAGACTG TCCGTGCCAT CTACAACGCC GTGGACCCGC AGGGCCAGAA GAGCGCCGCG GCCGTCACGA TCCACATCCT GCCGCTGGAA GGCGCGGAGA ACTCCAGGCC GCAGCCAAAG AACCTGACGG CACGGGTGGT GGCCGCCGGG TCGGTCCGCA TTCCGGTGGA CCTGGACGGC ATCGATCCCG ACGGCGACTC CGTCCAGTTG ACCGGCATCG ACAGCACCCC CAATATGGGC ACCGCCACTG TCGGCAGCAA CTTCATCGAT TTCACGGCGG CGGGCGACGG CGCGGGCACT GACACATTCC GGTACAAAGT GGTGGACCGG CAAGGCGCCG TCAATACCGG CACGGTTACC GTAGGCATCG CGCCCCGCGG CGATGCGAAC CAGAAACCCA CTCCTGTTGA TGACGACGTC CAGGTCCGTC CGGGACGCCA GATTGCGGTG GACGCGATCG GGAACGACAC TGACCCCGAC GGCGACCCCA TCGGCGTCGT TGGGGACGGG ATCGAGGCGC CCGCGGAACT CCAGGCCACC GTGAGCAAGG CGAGCGGCCG CATCATCCTG CAGGCGCCGG CCAGCGAGGG TACGGTCAAC GTCCGCTACA CGGTCGTCGA TGACCGCGGC GCCTCTGCCC AGGCCGCCAT CCGGGTGAAC GTCCGCAATG ACGTGCCCTT GAAGGCACCC ATTGCCCGGG ACGACCGCGT GACCTCCGCC CAGACACTGG GCAAGACCGC CGTGGACGTT CCCGTGCTGA AGAACGACGA AGACCCGGAC GGCGTGGGCG AGAACCTGAA GATCGCCACG GACGCCACCA CTGCCCGGCC GGGGACCGAC GGCAACATGA TGGTGGAACT GACCGAGCAG CCCCAACTGA TCCCCTACAC GGTCGAGGAC GTGGACGGCC AGAAGTCCAC CGCCATCATC TGGGTGCCCG GCATCGGACA GCAGGTGCCG ACGCTTGCCA AGGACGATGT ACTCGAGGTG ATCGCCGGAC AATCCGTGAC GGTGGACCTG AAAGAATGGG TCAAGGTCCG TGACGGACGG TCGCCCCGCC TCACCCAGAC GGACCGGATC AAGCTGATCG GCGCCGACGG CGGTGACCCT GTGGCCGGAA ACGGCACAGC GATCAACTAC GCGGCCGGGC AGGACTATGT GGGCCCGGGG TCCATCAGCT TCGAAGTCAC CGATGGCAGC GGACCGGACG ATCCGGCCGG CCTGAAATCG ACCCTCAGCA TCCGGACAAA AGTCTTGCCG GACCCCAACC GCAACAACCC GCCCACGCTG CTGGGAAGCT CTGTGGACGT GCCCAAGGGC GAGTCCGCGG AGATCGACCT GGGCCGGCTG ACCTCGGACC CTGACCGCGA TGACGTGGAC AACATGAAGT ACGAGCTGGC GGGGGACGGC CCGGCCGGTT TCAACGCCCG CATCGACGGC AGGACCCTTA AAACCTCCGT GGACGGGGCC ATGGCCACCG GCACCTCGGG CGCCGTGCAG GTCAAAGCGA AGGATCCGCG GGGACTGGAA GCGACGGCGA CATTCCAGCT CGCTGTCACC GCCTCCAACC GGCCGAAACC GGTGGCTAGT GACGACGTCG AGCCCAACGC CGCCGCCGGC AAGACGGTGT CCGTAAATGT CCTCGCCAAC GACTCCAACC CGTTCCCCGA GACGCCGCTG AAGATTTTCT CGGCTGCAAC CGAGACAGGA AGCGGCAATG TGGAGGTCGC CGGCAGCAAC GTCAACGTCA CTCCTGCTTC CGGCTTCACC GGGACCCTGA TTGTGGTCTA CACGGTGGAG GACAAAACAG GGGAGACCTC CCGGCACGCC ACCGCCAGGG TCCGGCTGAC GGTCAAGGAC AAGCCCCTGG CACCGGCCAC GCCGCAGGCG CAAAGCGTGG GCGACCAGAC CGCCCTGCTG AACTGGACGG CACCGGCGGA CCGCGGCTCA CCTATCACCA AGTACACGGT GTACGGTGAG GGCGGCTTCC AGCAGGCCTG CCCGGCGAAC AGCTGCACGC TCACCGGACT GACCAACAAC ACGAAGTACC ACTTCCAGGT CACTGCCACG AACGAGTTCG GCGAATCCGA GCGCTCTCCG GCGTCCGCCG AGGTCCGCCC GGACGTCAAA CCCGACACCC CGGTTGCCCC GGCGCTCAAG TTCGGCGACA AACAGCTCTC CGTGACGTGG ACAGCCCCGG CCAGCAAGGG TTCACCGGTC AAGTCCTACG ACCTGGAGAT CTCCCCGGCC CCGGCCGGGC AGAACGCCCA GATCCAGAAC CTGACATCGC TCAGCTACGT CTGGAAGGGG CTGCAGAACG GGGTCGCCTA CAAGGTCCGG GTCCTGGCGC GCAATGATGC CAAGGAACCG TCCGAGTGGA GCGCCTACTC CGCTGCCGAA ACCCCTGCCG GTGTTCCGGT CACCCCGGCC GCGCCCAACG CGACGGCGGC GCAGTCTGTC GGAACGCAGA GCCAGCTCAG GGTGACCTGG ACGGCCCCGA ACAACAACGG CGACGCCATC TCCGCCTACA CGCTGACCAC CTTGAGGGGC GGTGCCGTCG TGACCACCCA GCAGGTGTCC GGGACGTCGC AGAACGTAAC GGTGGACAAC TCCGAGTCGG GGTACACCTT CACCGTGTCG GCCACCAACA AGGCGGGTAC CTCGGGCACG AGTGCGCAGT CCGCGGCCGT GCGGGCGGTG GGCAAGCCGG ACATGGTGGG CAAACCCACC GCGACCCTTG TCGACACCGG CGGCGACGGC GGGAAGATCG ACGTCAGGTT CCCTGTCCTG ACCGATGCAC AGCGGAACGG ATCGACTCCC GGCGAGATCA CCTACAAGTA CCGGCTGACA TCGGGCGGCG GCAGCGGCAA CATCGCCGCC GGCGGAGGGA TTGTCGCGGC AGCCAACGGC ACCGATACCG CCGTTGTGGT GTGGGCCGTC TCGTCCCGCA GCTCCACCGC GGGCGATGCA AGCCCTCCAT CAAACGTGGT GAACCCTTAC GGGCTGGCTT TCGCCCCTAC CGTGCAGGGA AGCGGCAGCG GCGGCGTTGG AGACAAAACC GTTTCCTGGA CCTGGAACCA GCCCAGCGGC AACGGCCGTG CGGTGACGGG CTACCAGTAC AGCCTCGACG GCGGAGGCTG GGTCAACACC GATCAGCGGT CCTTCTCCAA GAGCGTGGGC TTCAGCGAGA CCCACACCCT GCGGGTCCGC GCCATCAGTG CCAACCAGCC CGGGCGCATC GGCAGCGATA CCTCGCGGAG CGGAGCGGAA CCGCCGCCCC CGGCCCCGAC GTCGTGGAGC ATCACCGTTA CCCCCGTCCG AAGCTGTACC GAGCCGAACC GGACCACCGA TAGCTTCCGG CAGGGCAATC CCTCGAGCTG CGTCTCACCA GGCAAGTGGA TGGACGCCGG CGTCACTGCG CAGTCCGACT ACTACGTTGT CTGGACGAAG AGCAGTGACA ACCCCACCGG CATCTGGTAC CACCTGACGT CCGGTCCGGC CGCCGGCAAC TTCGTCCGCC ATGACACGAC GGACAGGGAA AATTCGGGAC CGCCGCCAGG CATGCCCCGG CGGTGA
|
Protein sequence | MAGAIIYPGF KTTEVELNDG GVWVVSKTKN AVGRLNYPSR VLDGAVTPAS TTFDILQNSG NVFVDDETGS TLNQVSPANM QLGGDKQLPG SADVSFGSAV ISVTDAAKGK VWALSPSTVN GFDEESTEPV LAGSEGLVSA VGTDDRIYSA DPKTGVVTVT AVDANGEVVS SESGTWDGLK GAGDLQLAVV GDKPVVLDAG RGKLFLPGGR GLDLENARDA KLQQSGPASD VVAVATQKAL LKQPLDGSAA KTVSFDGEGV PAAPVQLGGC VHAAWSGANK YVRDCVNDAD DKNVEVPKAS ASPSYVFRVN RDLVVLNDVN SGNVWLVNQN MQLVNNWDDV VPPKNQSNDQ DQESADNNTI NILPDRTKPN RPPETKPDTV GVRPGRTTIL SVLDNDSDPD GDVLTASVVG SGPASGTLQS IYGGTAFQIT VPADAKPGSE TFGYNAADGR GLSAGGQVSL NIVGADENKP PLFKRGDPTT MLVEQGKTVS QNILTDWTDP DGDDLVLLDA KADNEQDQVK VRRDGLLTYQ DSGAASGKKT VTVSVWDGRD TTTGQVVVNV QPPGALAPVV NADHVTAVVG QDLVIAPLKN DVDPNGGALR LAQVEASGPA ELGPVTDGGT FTFRSETAGP VYLTYIASNG PQSSQGLIRV DVESGKDTGD PVAVHDVALM PTGGSVLLDP LANDSDPSGG VLVLQSVQLP ENTTASVSVI DHSVLRITDV LGTKDPFLFQ YTMSNGRKSA TGSVSVVPVP APAVVEAPQP KPDEVNVRVN DVVTIPVLAN DTHPQGQKLT VDPVLPQAVA EADGKGFVSE NTLRFIAGPQ PKTVRAIYNA VDPQGQKSAA AVTIHILPLE GAENSRPQPK NLTARVVAAG SVRIPVDLDG IDPDGDSVQL TGIDSTPNMG TATVGSNFID FTAAGDGAGT DTFRYKVVDR QGAVNTGTVT VGIAPRGDAN QKPTPVDDDV QVRPGRQIAV DAIGNDTDPD GDPIGVVGDG IEAPAELQAT VSKASGRIIL QAPASEGTVN VRYTVVDDRG ASAQAAIRVN VRNDVPLKAP IARDDRVTSA QTLGKTAVDV PVLKNDEDPD GVGENLKIAT DATTARPGTD GNMMVELTEQ PQLIPYTVED VDGQKSTAII WVPGIGQQVP TLAKDDVLEV IAGQSVTVDL KEWVKVRDGR SPRLTQTDRI KLIGADGGDP VAGNGTAINY AAGQDYVGPG SISFEVTDGS GPDDPAGLKS TLSIRTKVLP DPNRNNPPTL LGSSVDVPKG ESAEIDLGRL TSDPDRDDVD NMKYELAGDG PAGFNARIDG RTLKTSVDGA MATGTSGAVQ VKAKDPRGLE ATATFQLAVT ASNRPKPVAS DDVEPNAAAG KTVSVNVLAN DSNPFPETPL KIFSAATETG SGNVEVAGSN VNVTPASGFT GTLIVVYTVE DKTGETSRHA TARVRLTVKD KPLAPATPQA QSVGDQTALL NWTAPADRGS PITKYTVYGE GGFQQACPAN SCTLTGLTNN TKYHFQVTAT NEFGESERSP ASAEVRPDVK PDTPVAPALK FGDKQLSVTW TAPASKGSPV KSYDLEISPA PAGQNAQIQN LTSLSYVWKG LQNGVAYKVR VLARNDAKEP SEWSAYSAAE TPAGVPVTPA APNATAAQSV GTQSQLRVTW TAPNNNGDAI SAYTLTTLRG GAVVTTQQVS GTSQNVTVDN SESGYTFTVS ATNKAGTSGT SAQSAAVRAV GKPDMVGKPT ATLVDTGGDG GKIDVRFPVL TDAQRNGSTP GEITYKYRLT SGGGSGNIAA GGGIVAAANG TDTAVVVWAV SSRSSTAGDA SPPSNVVNPY GLAFAPTVQG SGSGGVGDKT VSWTWNQPSG NGRAVTGYQY SLDGGGWVNT DQRSFSKSVG FSETHTLRVR AISANQPGRI GSDTSRSGAE PPPPAPTSWS ITVTPVRSCT EPNRTTDSFR QGNPSSCVSP GKWMDAGVTA QSDYYVVWTK SSDNPTGIWY HLTSGPAAGN FVRHDTTDRE NSGPPPGMPR R
|
| |