Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4525 |
Symbol | |
ID | 4597044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4785581 |
End bp | 4786810 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639779136 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_925709 |
Protein GI | 119718744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.517886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGTTC AGGCTGTTCC GGTCACGTTG TCGACGGTCG TCGTCGCGGT CGATGTTGGC AAGACTGAGT TCGCGTTCTC GGTCACCGAT GCCACCCGGA CATTGCTCCT CAAGCCGCGG ACCGGCTGCC CGATGACCGG GCCGTCGCTG GCACAGGTCG TGGCCGACAT CGCCCGGTTG CTCCCGATCG ATGCGGTGGT CAAGGTCGGT ATCGAGGCGG CCGGGCATTA CCACCGGCCG TTGCTGATGA CAGGGGCGTG GCCGGGGACC TGGGAGGTGC TCGAGCTCAA TCCGGGTCAT GTCACCGAGC AGCGTCGGGT GCTGGGCAAA CGCACCATCA AGACCGACGT GATCGATCTT CAGGCGATGA CCGAGCTGCT GCTGGCCGGT CGCGGTCAGC CGGTCCGGGA CCGCTCGCTG GTGTTCGGGG AGTTGACGGC GTGGTCGGCG CATCGGGTCG GCCGGGTGGC GTTGCGGACA GCGACGAAGA ACCAGCTGCT CGGACATCTG GACCGGACCT TCCCCGGGCT GACCCTGGCG CTGCCGAATG TGCTGGCCAC CAAGGTCGGC CGGCTTGTCG CCACAGAGTT CCCCGATCCG GCACGGCTGG CCGCGTTGGG CAGCAGCCGG TTCATCCGCT TCGGAGCGAC CCGGGGCCTG CAGATCCGCC GGCCCGTGGC TGACAGGCTG GTCCAAGCCG CTCGCGATGC GTTGCCCACC ACCGATGCAG CTGTCGCCCG CGCCGTCCTT GCCGCTGACC TCGCGCTGCT CGACGACCTG GACGCGCAAG TCGACCAGGC CACCGAGCAG CTGGCTCGAC TGCTGCCCCG TAGCCCGTTC GCGCCGCTGC TGACGGTCCC AGGATGGGGC GCGGTCCGGG CCGGAAACTA CGGCGGCGCC CTGGGGGACC CGGCCCGATT CGACAACCAC CGGCAGATCT ACCGCACCGC CGGACTCAAC CCGATCCAGT ACGAGTCCGC CGGAAGACGT CGCGACAGTG TCATCAGCCG CGAGGGCAGT GTCGAGCTGC GTCGCGCGCT GATCGACCTG GGAGTGGGGT TGTGGCTCAG TGAGCCCGCC GCCAAGGTCT ACGGTGCGCA GCTTCGTGAC CGTGGCAAGA AGGGTCTGGT CATCGCCTGC GCGATGGCCA ACCGCGCGAA CCGGATCGCG TTCGCTCTGG TCCGCGACCA GAGCACTTAC GATCCCAGCA GGTGGATCCG GGAGGGCTGA
|
Protein sequence | MVVQAVPVTL STVVVAVDVG KTEFAFSVTD ATRTLLLKPR TGCPMTGPSL AQVVADIARL LPIDAVVKVG IEAAGHYHRP LLMTGAWPGT WEVLELNPGH VTEQRRVLGK RTIKTDVIDL QAMTELLLAG RGQPVRDRSL VFGELTAWSA HRVGRVALRT ATKNQLLGHL DRTFPGLTLA LPNVLATKVG RLVATEFPDP ARLAALGSSR FIRFGATRGL QIRRPVADRL VQAARDALPT TDAAVARAVL AADLALLDDL DAQVDQATEQ LARLLPRSPF APLLTVPGWG AVRAGNYGGA LGDPARFDNH RQIYRTAGLN PIQYESAGRR RDSVISREGS VELRRALIDL GVGLWLSEPA AKVYGAQLRD RGKKGLVIAC AMANRANRIA FALVRDQSTY DPSRWIREG
|
| |