Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3074 |
Symbol | |
ID | 4600191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3274022 |
End bp | 3275296 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639777680 |
Product | hypothetical protein |
Protein accession | YP_924263 |
Protein GI | 119717298 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.144067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAGG CGCTGGCCGG GCTGACGGTC CGTGGCCGGG CGTTCCTGGC GGCCGGCATC ACCGCCGTCG TGTGCGCGAT CGCCCTCGAC CAGACCGCCC TGACCCGGAT CGGCGTGCTC GTCCTGGTCC TCCCGCTGCT CACGGCATGG GTGGTGGGGC GCAACCGGTA CCGGCTGGCC CTGGTCCGCA CGGTCACTCC ACAGCTGGTC GCCGCGGGCC AGCCGGCCCG GGTGTCGCTG GCCCTGAGCA ACGAGGGCCG TACGCCGAAC GGCGTGCTGC TGCTGGAGGA CCAGGTGCCC TACGTGCTCG GCACCCGCCC GCGCTTCGTC GTCGAGGGCA TCGGCTCCGG CTGGCGGCAG ACCGTCAGCT ACCAGGTCCG CTCCGACGTG CGCGGCCGCT TCGAGATCGG GCCGATGTCG GTGCGGGTCA CCGACCCCTT CGGGCTGGTC GAGCTGGGCC GCACGTTCCG CACCACCGTG CCGCTGACCG TGACGCCACG CACGGTGCCA CTCCCCCAGA TTCCGCTCGG TGGCGCCTGG ACCGGGTCCG GTGACAACCG GCCCCGCGCC TTCGCCACCG GCAGCGCCGA GGACGTCACT GTCCGGGAGT ACCGCCAGGG CGACGACCTG CGCCGGGTGC ACTGGCGCAG CTCGGCCCGG CTCGGCGAGC TGATGGTGCG GCGCGAGGAG CAGCCGTGGC AGTCCCGGGC GACGCTGGTC CTCGACAACC GGGTGCTGGC CCACCGGGGA CAGGGCATCG CATCCTCGCT GGAGGCGGCC GTCTCCGCCG CCGCGTCGAT CGCGGTGCAC CTGAGCCACC GCGGCTTCGC CGTACGCCTG GTCACCGCGC TGGGCGAGGA CCCCAGCAGC GCCTGGCACC TGCGCGACGC CGACCTCAAC ACCGGGCCGC TGATGGAGGC CCTCGCGGTG GTGCAGGCGA CCCACCAGTC CCGCCTCGAC ACTGCGTGGC TGGCCGAGGG TGCCCACGGC GGGCTGACGG TCGCCGTCTT CGGTGGCATC CTGCCCGCCG ACCTCCCGGT GCTGCGCCGG ATGCAGCACC AGGCCGGGTC GGCACTCGCG ATCGCGCTCG ACGTCGACGC CTGGACCGGC GCGCCGGCCG GCGTGGGCGC AACCCCGGCC CTCGGCCAGC AGGGCTGGCG AGCGGTGCCG CTCGGCCCGC GCGACCGGCT CGAGTCCGTG TGGCAGGAGC TCGGGCACAC GAGCGCGCAG CGCTCCCGCG TCGTCAGCCG GAGTGCCTCG GAGGCGACCG TGTGA
|
Protein sequence | MREALAGLTV RGRAFLAAGI TAVVCAIALD QTALTRIGVL VLVLPLLTAW VVGRNRYRLA LVRTVTPQLV AAGQPARVSL ALSNEGRTPN GVLLLEDQVP YVLGTRPRFV VEGIGSGWRQ TVSYQVRSDV RGRFEIGPMS VRVTDPFGLV ELGRTFRTTV PLTVTPRTVP LPQIPLGGAW TGSGDNRPRA FATGSAEDVT VREYRQGDDL RRVHWRSSAR LGELMVRREE QPWQSRATLV LDNRVLAHRG QGIASSLEAA VSAAASIAVH LSHRGFAVRL VTALGEDPSS AWHLRDADLN TGPLMEALAV VQATHQSRLD TAWLAEGAHG GLTVAVFGGI LPADLPVLRR MQHQAGSALA IALDVDAWTG APAGVGATPA LGQQGWRAVP LGPRDRLESV WQELGHTSAQ RSRVVSRSAS EATV
|
| |