Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4786 |
Symbol | |
ID | 4595391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | + |
Start bp | 104853 |
End bp | 106181 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639772573 |
Product | IS4 family transposase |
Protein accession | YP_919233 |
Protein GI | 119714091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.0321655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.753556 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGGCGC TGGCCGAGCA GGCCGGCCTC TCCGAGCTGG TCGCCGACCG GGTCACCTTG GCCTCGCTCT CGCCGCGGGT GGCCTCGGCG GGGGTGAACC CGGCCGGGAA GGTGACCTCG ATCATCGCGG GGATGGCCGC GGGCGCGGAC AACATCGACG AGTTGCAGGT GATCCGCTCC GGCGGGATGA AGCGACTCTT CGACCAGGTG TACGCGCCGG CCACGCTGGG CCAGTTCCTG CGCGAGTTCA CCCACGGCCA CACCTTGCAG CTGGCCTCGG TCGCCCGGGC CCACCTGGTC CACCTGGCGG CCCGGACCAA CCTGCTGCCC GGCATCGAGT CCCAGGCCTA CGTCGACATC GACTCGCTGC TGCGCCCGGT CTACGGGCAC GCCAAACAGG GCGCCAGCTT CGGGCACACC AAGATCGCCG GCAAGCAGGT GCTCCGTAAG GGTCTCTCGC CGCTGGCGAC CACGATCAGC ACCGCCCAAG GGGCTCCGGT GCTGGCCGGG ATCCGGCTGC GTGGCGGGAA GACCGGCTCT GGCAAGGGCG CGGCCTCGAT GGTCCGCGAG GCGATCAAGA CTGCTCGCGA CTGCGGTGCC ACCGGCGAGA TCCTGGTGCG TGGTGACTCC GCCTACGGCA ACAGCGCAGT CGTGGCCGCG TGCCTGAAGG CAGGCGTCCG GTTCTCGCTC GTGCTCACCA AGAACCCGGC GGTGTCCGCC GCAATCGGCT CCATCCCCGA GGACGCCTGG ACCCCGGTCA CCTACCCCGG AGCCGTGATC GATCCGGACA CCGGGGAGCT GATCAGCGAC GCGCAGGTCG CCGAGGTCGA GTTCACCGCG TTCGCCTCCA CCGAGCACCC GGTCACCGCC AGGCTGGTGG TGCGGCGGGT CCGCGACCGC GCCAAGCTCG ACGAGCTGTT CCCGGTCTGG CGATACCACC CGTTCCTCAC CAACAGCACG CAGCCGACCG TGCAGGCCGA CCTGATCCAC CGGCGGCACG CGATCATCGA GACCGTCTTC GCCGACCTGA TCGACGGGCC CCTGGCGCAC ATGCCCTCGG GACGGTTCGC GTCCAACAGT GCATGGGCGA TCTGCGCGAT GATCACCCAC AACCTGCTCC GCGCCGCCGA CACCCTCGAC CCCCACGCCG CTGCACCCGC GCGAGGCGCG ACGCTGCGCC GCCAGATCAT CCACGTCCCA GCCCGGCTCG CCCGCCCGCA ACGCCGCCAT GTGCTGCACC TGCCCGCGCA CTGGCCCTGG GCGAACCGCT GGCTGCGGAT CTGGACCGGC GTGTTCAGCC CCGCCCAAGC GCCACCACGC GCGGCCTGA
|
Protein sequence | MMALAEQAGL SELVADRVTL ASLSPRVASA GVNPAGKVTS IIAGMAAGAD NIDELQVIRS GGMKRLFDQV YAPATLGQFL REFTHGHTLQ LASVARAHLV HLAARTNLLP GIESQAYVDI DSLLRPVYGH AKQGASFGHT KIAGKQVLRK GLSPLATTIS TAQGAPVLAG IRLRGGKTGS GKGAASMVRE AIKTARDCGA TGEILVRGDS AYGNSAVVAA CLKAGVRFSL VLTKNPAVSA AIGSIPEDAW TPVTYPGAVI DPDTGELISD AQVAEVEFTA FASTEHPVTA RLVVRRVRDR AKLDELFPVW RYHPFLTNST QPTVQADLIH RRHAIIETVF ADLIDGPLAH MPSGRFASNS AWAICAMITH NLLRAADTLD PHAAAPARGA TLRRQIIHVP ARLARPQRRH VLHLPAHWPW ANRWLRIWTG VFSPAQAPPR AA
|
| |