Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4679 |
Symbol | |
ID | 4598223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4959854 |
End bp | 4963957 |
Gene Length | 4104 bp |
Protein Length | 1367 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639779288 |
Product | PKD domain-containing protein |
Protein accession | YP_925861 |
Protein GI | 119718896 |
COG category | [S] Function unknown |
COG ID | [COG4412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03296] M6 family metalloprotease domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACCGC GTCGCCTCCG GCGACCCCTC GAGATTGCGT CCTCCATGAC CAACCCTCCA TCGAGCACCT CGTCGAGCAT CCCGGCGCGC ACGCTGCGCC CCTGGCGGAC CGCAGCGGTC ACCACCGCGC TGGCGGTCGC CCTCACCACC GGCCTGTCGG CTGTCCTCGC CCCGGTCGGG GCCGCCGCAC CCGCCAGCAC CCAGCCCCGG CCGATCGGCG CCCACGGCCT GTCCGTCTCC TTCGACCGCC AGTCGGCCAC CATCGAGCGC GGCGAGCGGT TCCGGGTGTC CGGCACCGTC TCGACGCTGA GCCAGACCTC GCTCACCGCG GTGCCCCGGA CCGGTGAAGG GGTGCCGGCG AGCTTCACCC TGACCGTCAC CGACCCCGCC GGCAGGGTGC TCGGCACCCA GGCCGTGACC GCCGCCGACG ACGGCAGCTT CGCCACCATG GTGCCCGGCG CCATCACCGA CGGCCTCGCC GACGCGGACC TGCTGCGACT GGGCCTGCGC GCCGTGGACG CGACGTACGA CGACCACCAG GCCGCCGACG CCGGCGCCGG ATCGGTGGCG GTCCGCGCGG CCGCGACCGG GCTCCAGGTC GAGAACAGCT TCGTCTCGGC CGTCGGCTGG GTGAAGCCGG GCGAGGCCTA CCCCTCGCGG ATCATCGTCC AGAACCCCAC CGCCACCCCG GTGGCCGGCG CCTCGGTCAC GATCACGGCA CCCGCAGGCA CGAGCTTCAC GAACGCCAGC GGCCCCGGCA CCCACCCGTT CAGCAGCGAC ACGGTGGCCT GGACGATCCC GTCGGTGCCG GCGGCGACCG GCACGGTGCC CGGCACCGTC ACCCTGGTCC TGGAGAGCAA GGCCGACACC GTCGCCCAGG ACCCGACGGT GGTCTGGCGC GACCTGTCCA CCACCGCTCG CCTCACCGCG AGCACCGGGG CCCAGACCGT GGTCAGCCAC GGCCCCAAGG TGATCCCGCC GAGCGACGCC TTCGACACCG CCCGGTACGG CGACCGCCCG TTCCCGGTCA TCCCGGTGGA GTACCGCGAC CGCGCGTACA CCGCCGCGCA CACCGGTGAG CAGCTCGAGG AGGTCATCAA CTCCCCCGCC AAGCCCGGCT CCACCTTCAA CCTGTACCAG GAGATGTCCC TGGGCCAGCT CTTCCCCAAC GGCAGCGTCC CCTCCGCGGG GATCGCGACG GCCGACTTCA CCAGCTACGC CCCCGGCTTC CCGTTCACCC AGATCGACCC GACGGCCGTC AACACCTGCG CCGGCGTCAC GCAGACCGAC ACCCCGGCCG GCGCGATCCC CGCGCCCGGC ACGGTCGGCG GCCCGGCGTA CACGGAGCGG ATCACCAACG GCGTCTACAA CCTGCCCGGC ACCACCGGCT ACTACGGCTC CGACGGCGCG GGCTCCGCGG TGATCGGCTC CCTCACCGGC ATCTCGGCGC TCGCCCAGAT CGACAGCGGC TGCGGGCCGA CCAGCAAGCT CGTCGTGGAC GCGGCCGCGC TCGCCGACCC CGAGATCGAC TACTCGGACT ACGACACCGA CAAGGACGGC GTGGTCGACT TCTTCATGGC CGTGTTCGCC GGCTGCGGCG GCAACGGCGC CTCCCAGCTC GGGCTGTGCA GCGACGACCC CCAGGACGCG CTGCCCTACG ACAACGTCTG GCCGCACAGC TCGTCGCTGG AGTACTACTA CAACGACGCG AAGACGGGCC TGCCCGGCTT CACCACCGAC GACCAGCTCA AGGACCTCGA GGGCCGGCCG CTCTGGTACA CCGACAAGAC CTACAAGGAC ATGACCACGA CCGACAAGGG CGACGCCCTC AAGGTCTTCG TCCGGGTCGG CCCCTACAAC CTCAACCCCG AGACGGCCAT CGACAAGGCG AGCGTGATCT CGCACGAGTA CGGCCACTCG CTCGGGCTGC CGGACTTCTA CTCCGTCGGC GGCCGCGAGA CCTACGGCGA CTGGAACCTG ATGGCGACCG ACAAGTCGCA GAACATGGAC GCCTTCTCCC GCCAGGAGCT CGGCTGGGTC GTCCCGCAGG TGCTCCGGGC AGGCGAGACC CGGACCGTCG ACGGCTGGAC CGACTCCAAG CAGGACACCG GCACGATCAC GTGGCAGCGC CCCGACGGGA CGCCGTACAC GCTCACCAAC GGGCCGGACG GCGTGGTCCA CAACTCGCTG ATGTACGTCG CGCGGCTGCC CGGTCGTCAG CTGATCGACC CGGCCAAGTT CGACACGGGT GACAAGGCGA CCAAGACCCA CGCGTGGTTC TCCGGCGCCG GCAACGACTT CGGCTGCGCC AACAACGGCG GCGGCCACAA CCTCGACATC GCGATCCCGG GGATCAAGGA CCTGCCCGCC GGCTCGACGG TCCGGCTGGA CCTCAAGTCG CTGTTCGACA TCGAGTGGGA CTTCGACTAC GGCTTCGTGC TGACCTCCAA GGACGGCGGG AAGAGCTTCA CCTCCCACGA GTCGCTGCGC GACACCCCGA CCACGACGCC GATGACCTCG AACCCGAACC AGAGCTCCTG CCAGTCGGCG TACGGCAACG GCATCACCGG ATCGAGCGGG TCCTACTCCG ACCCGGTCAC GGTCCAGCTC GACCGCACCA CCGGCAACTA CCCCGACTCG CAGTTCGTGG CCGACAGCTT CGACATCTCC GACCTCGTCG GGGCGGCCAC CCCGGTGCTG CGGTTCAGCT ACGCGACCGA CCCGGGGCTG GCCCGACCCG GCTGGTTCAT CGACGACCTC AAGGTCACCG CGACCACCCC GAGCGGCGAG AAGGTGCTGC TGCAGACCGA CCTGGAGAAG GACGGTGGTC CGAGCGACCC GCGGATCTTC AACGGCGGCT GCCAGGCCGA CAACCCGGGC AGCGACTGCA CCAAGGGCTG GCAGTTCGTG ACCGCCGGTG ACGAGGCGGC CTTCGACCAC GGCTACTACC TGGAGATGCG GGACCGCTCC GGCTTCGACC TCGACGGGCA CGGACAGATC GACCGCGACC CGATCGGCTT CGAGCCCGGG CTCTACCTCG CCTACACCGA CGAGGCACAC GGGTACGGCA ACGCGGGGAC CGACTCCCCG CCGGCCCAGT CGCCCCTGGA CTCGACCCCC GAGCCCGGCA ACGACACGCC GAACCTGAAC GACGCGGCGT TCACGGCGGC CGCGACCCGC TCGACGTACA CCGACTCGGG CCCCGGGCAC ACCGACAACT ACACCGACCC GTCGAACACC ACGGTCGACA GCCGGTACGC CGACGTCGCG AACCCGTGGC GGTTCCAGTA CGGCTGCCTG GGCTTCCGGG TGCTGTCCAT GGCCGGAAAC ACCAACGGCC CGGCGACGTC CGACGGCGAC CTGACCGGAT CGGTGCGGTT CACCATGGGC ACCGGCTGCG GTGACTTCGA CTACGGCTAC AGCCCGACGC CGGCGCCGGC CAACACCAAG CCGTCCGCGC GGGCCACGGC CTCGGCGACG ACGGTGAGGA CCGGCGACGC GGTGCGCTTC AGCGGATCGG ACAGCACCGA CGCCGAGACG CCGAACGACC TGGACTACAG CTGGGACTTC GGTGACGGCG GCTCGACCAA GGACGCCGCC GGCTCCTTCG CCCGGCACAC CTTCACCGAG CCCGGGACGT ACGCCGTCAC CCTCCTGGTC ACCGACCCCG AGGGCGCCAC CGACACCGAC ACGCTGACCG TCAAGGTCAC CGGCGACTCC ACAGGTCCCG GCCCCCAGCC GGGGGCCCGG ACGAGCGTCA ACTGCGGCTC CGCCAAGGTG ACCCGGCACG GCAGCTGGCG CGACGTGCGG CCGGAGCGCG GCGGCGGCTA CTGCGACAAC GCCGGCAGGG GCAACGGCCG CGACACGATG ACCCTGACCA CCAAGGGCCC GCGTGCGGAG ATCTTCTTCG GCCGCTCGGT CCACGGCGGG AAGGCGGCGC TGTTCGTCGA CGGCACGCAG GTCGGGACGA TCAGCTTCCG CAACAAGAAC AGCACCCCGG TCATCGCCTA CCGCAAGGTG CTGCGCGGCC TCGGCTCCGG CAAGCACCAG CTCCGCCTGG TGGTGCTCAC GGGCCGGGCC TACGTGGACC GGTTCCGCTT CTAG
|
Protein sequence | MAPRRLRRPL EIASSMTNPP SSTSSSIPAR TLRPWRTAAV TTALAVALTT GLSAVLAPVG AAAPASTQPR PIGAHGLSVS FDRQSATIER GERFRVSGTV STLSQTSLTA VPRTGEGVPA SFTLTVTDPA GRVLGTQAVT AADDGSFATM VPGAITDGLA DADLLRLGLR AVDATYDDHQ AADAGAGSVA VRAAATGLQV ENSFVSAVGW VKPGEAYPSR IIVQNPTATP VAGASVTITA PAGTSFTNAS GPGTHPFSSD TVAWTIPSVP AATGTVPGTV TLVLESKADT VAQDPTVVWR DLSTTARLTA STGAQTVVSH GPKVIPPSDA FDTARYGDRP FPVIPVEYRD RAYTAAHTGE QLEEVINSPA KPGSTFNLYQ EMSLGQLFPN GSVPSAGIAT ADFTSYAPGF PFTQIDPTAV NTCAGVTQTD TPAGAIPAPG TVGGPAYTER ITNGVYNLPG TTGYYGSDGA GSAVIGSLTG ISALAQIDSG CGPTSKLVVD AAALADPEID YSDYDTDKDG VVDFFMAVFA GCGGNGASQL GLCSDDPQDA LPYDNVWPHS SSLEYYYNDA KTGLPGFTTD DQLKDLEGRP LWYTDKTYKD MTTTDKGDAL KVFVRVGPYN LNPETAIDKA SVISHEYGHS LGLPDFYSVG GRETYGDWNL MATDKSQNMD AFSRQELGWV VPQVLRAGET RTVDGWTDSK QDTGTITWQR PDGTPYTLTN GPDGVVHNSL MYVARLPGRQ LIDPAKFDTG DKATKTHAWF SGAGNDFGCA NNGGGHNLDI AIPGIKDLPA GSTVRLDLKS LFDIEWDFDY GFVLTSKDGG KSFTSHESLR DTPTTTPMTS NPNQSSCQSA YGNGITGSSG SYSDPVTVQL DRTTGNYPDS QFVADSFDIS DLVGAATPVL RFSYATDPGL ARPGWFIDDL KVTATTPSGE KVLLQTDLEK DGGPSDPRIF NGGCQADNPG SDCTKGWQFV TAGDEAAFDH GYYLEMRDRS GFDLDGHGQI DRDPIGFEPG LYLAYTDEAH GYGNAGTDSP PAQSPLDSTP EPGNDTPNLN DAAFTAAATR STYTDSGPGH TDNYTDPSNT TVDSRYADVA NPWRFQYGCL GFRVLSMAGN TNGPATSDGD LTGSVRFTMG TGCGDFDYGY SPTPAPANTK PSARATASAT TVRTGDAVRF SGSDSTDAET PNDLDYSWDF GDGGSTKDAA GSFARHTFTE PGTYAVTLLV TDPEGATDTD TLTVKVTGDS TGPGPQPGAR TSVNCGSAKV TRHGSWRDVR PERGGGYCDN AGRGNGRDTM TLTTKGPRAE IFFGRSVHGG KAALFVDGTQ VGTISFRNKN STPVIAYRKV LRGLGSGKHQ LRLVVLTGRA YVDRFRF
|
| |