Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3573 |
Symbol | |
ID | 4599452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3790417 |
End bp | 3792108 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639778181 |
Product | hypothetical protein |
Protein accession | YP_924760 |
Protein GI | 119717795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGGT GCAAGTTCCA GTCCTGGACT ACCGCATGCT CATCCCTAGA TGCGCGCAGT TCGCAGCTGC GTCATTCCGC CTGTGTCCGC GCGACTGCGC GGACCAACCG CAGCCGGCAC GCCGCCGACC AGCGCCCCCC GGCTCGAACT CCACGGCTCG CCCGGCCCCT GGCCGGACAC GCGAACGCGA CGCGTCGCAG CAACGGGTTC CAGCCATCGT TTCCACAAGC ACGCCACCGA CTTGTTTTCG AAGCTTTGAT CGAATATAGT ACAGACATGT CCACTGACAT CAGCGACCTC GACGCGGCCG GCGTGCTGGC CGCTAGCGGT GGTGCGCTGC GCCGGCGGCG TGACGCGCAG GTGGCGGATC TGCTGCTGGT GCTGCAATGG GCGGACCTCC ACGGCGACGA CCCGCAGGAC CGGCCGGGCG CGGTGCCGGT GAGTCACGGC GGCGAGCGGC TGGTCGAGCT CGGCGGCGAC GGCACTCCCC TGGTTCAAGA CCTGTGCGTG CACGAGCTCG CGATCGCTCG GCACACCCAT CCCGTCTCGA CCCGGGCAAC GATGGGCGAT GCACTCGACC TGCGGCACCG ACTGCCGCTG ACCTGGGCCC AGGTGGTCGC CCTGGAGTGC GAGACCTGGG TCGCCTGCAA GGTCGCGCGC ATGTCGCGCC GCCTCGACCG CGACCAGGTC GGGATCGTCG ACCGAGCGGT GGCCGAGGCG ATCGGCGGTG AGTCGCCCGG CCGCGTGCTG GCGATCGCCG AGGCCAAGGT CATCGAGGCC GACACCGCCG CCCACGCGGC CCGGTTCGAC GACGCTCGGC GCCGCCGAGG AGTCTGGGTG AGCCGCACCG ACCTCGAGAC CGGCAGCCGC ACCGTCTTCT CCCGCATCGA GCCTGGCGAC GCCGCATGGA TCGACGCGAT GACCGAGCGG ATCGCCGACG CCCTCCTCGC CCGCCCCGAC CTGCTCCGCG CCCACCACGC CGACCTGCCG GCCGCCCTGG ACGACCTCAG CCACGACGAG CTGCGCGCGA TCGCCTTCGG CTGGCTCGCG CGCCCCGACG ACGTCTGCGA GCTGCTGCGC ATCGGCGTCG ACGAGGTCGT CCCGCGCAAG GCGCACCACC GAGCCGTGCT CTACCTCCAC CTCCACGAGA GCGCCCTGGC CGGCAGCTCC ACCGCCGTCG CCCGGGTCGA GGGACTCGGA CCCATGCTCC TCGAACAGGT CGTGTCGCTT CTCGGCCACG CCCACGTGAC CGTCAAGCCG GTGATCGACC TCGACGACCA CGTCAGCGTC AATGCCTACG AGTTCCCCAC CGCCGTGACC GAACGCACCC GGCTGCGCTG CGTCGGCGAC GTCTTCCCGT ACGCCGTCGG AACCGCCACG ATGAGCGGCC GCTACGACGA CGACCACGCC GTCCCCTACG ACCCGCTCGG TCCACCCGGG CAGACCGGCG ACCACAACGA CGCCCCACTC GCCCGATCAC ACCACCGGGC CAAGACCCAC CGCGGCTACC GCGTCAGACA GATCGGCGAC GGCGCCTATC TCTGGACCAC ACCTCACGGG CTGCGGCGAC TCGTCGACGT CGGCGGCACC CACGAGCTCT GCGCGAGCGA CGCCTGGCGG CTCGAGCACG CCGAGGAGCT CGACGCCGCC ATCCAGCGCA TGTCCGACGC CATGCAGGCG GCTGCTTCCT GA
|
Protein sequence | MGRCKFQSWT TACSSLDARS SQLRHSACVR ATARTNRSRH AADQRPPART PRLARPLAGH ANATRRSNGF QPSFPQARHR LVFEALIEYS TDMSTDISDL DAAGVLAASG GALRRRRDAQ VADLLLVLQW ADLHGDDPQD RPGAVPVSHG GERLVELGGD GTPLVQDLCV HELAIARHTH PVSTRATMGD ALDLRHRLPL TWAQVVALEC ETWVACKVAR MSRRLDRDQV GIVDRAVAEA IGGESPGRVL AIAEAKVIEA DTAAHAARFD DARRRRGVWV SRTDLETGSR TVFSRIEPGD AAWIDAMTER IADALLARPD LLRAHHADLP AALDDLSHDE LRAIAFGWLA RPDDVCELLR IGVDEVVPRK AHHRAVLYLH LHESALAGSS TAVARVEGLG PMLLEQVVSL LGHAHVTVKP VIDLDDHVSV NAYEFPTAVT ERTRLRCVGD VFPYAVGTAT MSGRYDDDHA VPYDPLGPPG QTGDHNDAPL ARSHHRAKTH RGYRVRQIGD GAYLWTTPHG LRRLVDVGGT HELCASDAWR LEHAEELDAA IQRMSDAMQA AAS
|
| |