Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0550 |
Symbol | |
ID | 4596217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 582461 |
End bp | 585370 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639775164 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_921779 |
Protein GI | 119714814 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGTG CGGACATCAC CCGCACCAGC GGCACGGTGT CACGGCTCGG TCGCACAGCC AGCCGCACCG GCCGATTGCT CGGCGCGGCC TTCGCCGTGG GCAGCATCGC GCGCGCCGCG GTGGCCGTCG GCAAGGTCGG CGCTGGCTAC GTCGACTCGC TGAACAAGAT CCAGGCGTTG ACCGGCTCCA CGGACGCGTC TCTGAGCCGC ATTGCAAGGC GCCTGGAGTC GCAGGCCTCG GTGTACGCGA AGTTCGGCCA GACCACCGGC GACGCCGCGG CCGGCATGGT CGAGCTGACC AAGTCCGGGC TGTCGGCCCG GAAGGCCCTC GGCGCCATCC GGGGAACCAT GATTCTCGCC AAGGCCGGTG AGCTCGAGGT GGCCGACGCT TCCGAGCTCG TCGCCAACAC CTTGAACACC TTCTCCCTGA AGGCTTCGAA GGCCAGCAAG ATCGCCAACG GCCTGGCCAA CGCGGCGAAC ATCAGCTCGG CCAACGTCAC CGACCTGGCA GAGTCGTTCA AGTACGCGGC GCCCCTGGCG GCCAAGGCAG GCCTGCCCCT CGACCAGGTC AACGCGATCC TCGCCGAGCT GTCGAACTCC GGGATCAAGG CGTCCCAGGC CGGCACCAGC CTGCGTGGCA TCCTGCTGGC CCTCCAGGCT CCGTCCACGG CCGGCGCGAC CGCGCTGGAG GACCTCGGCG TGCGGGTGTA CGACGCGCGA GGTCGGATGC GCGACTTCGG TGACCTGCTC GAGGACCTGC GCAAGGGCCT CGGTCGGCTC AGTGACGAGG GCCGGAACTC TGCGCTGAAG TCGATCTTCG GCCGCAACGC GATCACCGGC GCCCAGGTGC TGCTCAAGGG CGGTCGGCAG GCGCTCGACG AGTACACCGC CGGGGTCCGT AGGGCGGGTG CGGCTCAGGC GCTCGCGGAG TCGTCCTCGC GCGGCCTGGC CGGCACGATC AACCAGATCA AGGCCGCCGC CACGTCGACG GCACAGGCGC TCTACCGCAT CTACTCCCCG GTCATCGACC GGGCGCTGCG CACAGCCCTG GAGTTCGCGA CCAAGCACAA GAAGGTCCTC ATCCCTGCGT TGGCCGGTGC CGCCGCGGCC GCGGCCGGTG CCGCCGCCGC CCTCGGAGCC ATCGCCCTCG TGACCAGCCC GACCGCCCTC GCAGTCAGTG CCGTGGTCGG CCTGGGAGCA GCGGTCTCGG TGGCCTACAA GCGCAGCGAG ACGTTCCGCA ACGCCGTCGC GGCGACCGCG GACGCGCTCC GGAGCTTCGG CGGCTACCTC CAGGGCACCG TGCTGCCGGC CGTCACCCGT ACCGCCCGCG AGATCGCCCA GCGGCTCCAG CCTGTGCTCG CCCAGGCCGG CCAGACCTTC CGCAGCGACA TCGCCCCCGC GCTCGCCCAG GTCGCGGCGC AGTTCCGAGA GTGGCAGCCG ACGATCCGCC GCGTTGCCGA GACCGCCGGG CGGCTCACCG CTCAGGTGCT GATCTTCCAG GCCAAGCTCA GCGGCAAGGT GATCCCGGTC CTGATCCGGC TCGGCGGGAT CATGGCGCGC ACCAACATCC CCGTGGCCCT GCGGCTCGCC GACGCGCTCG TCAAGCTGAC CGCCGCGAAC ATGCGAACCG GAGCCGCGAT GCTCGCGGCC GGGCAGAAGG CGGCCCGGTT CGCTGCTGCT GTCCGCTCCA AGTTCAACGA TGCGATCGCG TTCATGACGA CCGTCCCCGG TCGCATCAAG GGCGCCCTGG GCGACCTGAG CAACATCCTG TACGCCGCCG GCCGCTCGGT CATCTCTGGA CTGATCTCCG GCATCGTCGA CAAGGCCCAG GACCTGTGGA ACACGCTCGC CGGGATCACG TCGAAGATCC CGCTCCACAA GGGCCCACCG GCTCGGGACC GCAAGCTGCT GCGCGGCACC GGCCGGCTGA TCATGGCGGG GCTGATCCGG GGTATCGACG ACGGCTCGGA GGGCATCAAG CGGGCGCTCG AGCGGATCAC GCAACTGATC CAGAAGAGGC TCGACGGGAA GAAGCAGGCC GACCGGCGCA AGTCGCTGCT CAAGAGCCTC AAGGACGAGG GCGCCGCGCT GCGGGCGAAC GGGAAGCTGC AGGACGCCGT GGCCCGGGCG CTGGAGAAGG CCACGAGCCG GTACAAGGAC CTGATCCGGA CCTCGCGGGA GTACGCCGCC AGCGTCAAGG CCGGATTCCA GTCCTACGGC AGCGTGGTCG GCCTGGGCAC CACCGGGGGC GGCACCGCGG TGACCCTGCC GGCGTTGCTC TCCCAGCTGG CCGCCCGTGC GAGCGTGGCC GACCAGTTCA CCGCGATCAT CGAGAAGCTG AAGAGCCGGC TGAACAAGAC CTCGCTCAAG CAGCTGCTCG ACCAGGCCGC CCAGGGCGAC CTCGAGGGCG CGCTGGCGAC CGCGCAGGCG ATCGCGTCGG GCGGCCCGGC CGCGGTGGCC CAGATCAACG CACTGACCGC GCAGATCGGC AAGGCCGGCG GCAAGCTCGG CGACTCGGCG GCCGCGTCCC TGTTCGCCGC CGGCATCCGC GCGGCCGAGG GCTTCGCGAA GGGCCTCAAG CGACAGGAGC GGCGGCTCGA CCAGACCGCC GACCGGATGG CCGACCGGCT GGTCGACAGG ATCCGCACGC AGCTCGGCAT CGGGAAGACC TCGGCGCCGC GCACGCCCGC GCCCCGGTTC AGCGCGTCCG ACACCTACGC CGGCCGGACC CCGGCGTACG ACGCCTACAT GCGCCAGGCC GCCCGGCCGA GCTCCGGCGG TGACGTGCAC ATCACGCTGA ACGTCCAGGC GCCGGTCGGC TCCAGTAGCC AGGACATCGG TCGGACCCTG ACCAAGCACC TCGACGCCTA CTTCGGTGCT GGCGGACGCC GGCACACACG CTGGGCCTGA
|
Protein sequence | MARADITRTS GTVSRLGRTA SRTGRLLGAA FAVGSIARAA VAVGKVGAGY VDSLNKIQAL TGSTDASLSR IARRLESQAS VYAKFGQTTG DAAAGMVELT KSGLSARKAL GAIRGTMILA KAGELEVADA SELVANTLNT FSLKASKASK IANGLANAAN ISSANVTDLA ESFKYAAPLA AKAGLPLDQV NAILAELSNS GIKASQAGTS LRGILLALQA PSTAGATALE DLGVRVYDAR GRMRDFGDLL EDLRKGLGRL SDEGRNSALK SIFGRNAITG AQVLLKGGRQ ALDEYTAGVR RAGAAQALAE SSSRGLAGTI NQIKAAATST AQALYRIYSP VIDRALRTAL EFATKHKKVL IPALAGAAAA AAGAAAALGA IALVTSPTAL AVSAVVGLGA AVSVAYKRSE TFRNAVAATA DALRSFGGYL QGTVLPAVTR TAREIAQRLQ PVLAQAGQTF RSDIAPALAQ VAAQFREWQP TIRRVAETAG RLTAQVLIFQ AKLSGKVIPV LIRLGGIMAR TNIPVALRLA DALVKLTAAN MRTGAAMLAA GQKAARFAAA VRSKFNDAIA FMTTVPGRIK GALGDLSNIL YAAGRSVISG LISGIVDKAQ DLWNTLAGIT SKIPLHKGPP ARDRKLLRGT GRLIMAGLIR GIDDGSEGIK RALERITQLI QKRLDGKKQA DRRKSLLKSL KDEGAALRAN GKLQDAVARA LEKATSRYKD LIRTSREYAA SVKAGFQSYG SVVGLGTTGG GTAVTLPALL SQLAARASVA DQFTAIIEKL KSRLNKTSLK QLLDQAAQGD LEGALATAQA IASGGPAAVA QINALTAQIG KAGGKLGDSA AASLFAAGIR AAEGFAKGLK RQERRLDQTA DRMADRLVDR IRTQLGIGKT SAPRTPAPRF SASDTYAGRT PAYDAYMRQA ARPSSGGDVH ITLNVQAPVG SSSQDIGRTL TKHLDAYFGA GGRRHTRWA
|
| |