Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3974 |
Symbol | |
ID | 4598109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4193625 |
End bp | 4194737 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 639778579 |
Product | VWA containing CoxE family protein |
Protein accession | YP_925158 |
Protein GI | 119718193 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0702062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACC GGCTGCCGGA CGAGATCCTC CTCGCCTTCG CCCGGGCGCT GCGGGCCGCA GGCGTCCCCG TGACCCACGA CCGCGCCCAG GGGTTCCTGG CGGCGGCCGC CGAGCTCGGC GTGGCCGACC AGCGCGCGAC CTACCTCGCC GGCCGCGCGA CGCTGTGCGC CGGGCCGGAC GACCTCGAGC GCTACGACCA GGTCTTCCAC GCGTTCTTCA ACGCCCGCGA CGGGCTGCCC CGGTCGCGTC CGGCCCACCC CGGGGCACCG TTCGCCGGGC TGCCGCTGGA GGAGTCCACC GGCACCGGCG AGGGCCAGGA CGAGGAGGTG CTGCGGGCCC TGGCCAGCGA GACCGAGGTG CTGCGCCAGC GCGACGTCGC CTCGCTCAGC GCGGCTGAGA AGCACCGGCT CGCGGGCATG TTCGCGACCC TGCGGCCGCG ACCACCGACG CGGCGTACCG CCCGCCACCA GGCGTGGCAC CGCGGCGAGG TCGACGCCTC GCGCACGCTG CGGGCCTCGC TGCGCCGGAT GGGCGAGCCG GCCGACATCG CCTGGCGGCG CCGCCGCCGG CGGCCGCGAC GGGTGGTGCT GCTCGTGGAC GTGTCCGGCT CGATGAGCGG GTACGCCGAC GCGCTGCTGC GCCTGGCGCA CCGGTTCACC CAGGTGGGGC GCGAGAGCGG CGGCGTGGTC GAGACGTTCA CCGTCGGGAC CCGGCTGACC CACCTGACGC GCGCGATGCG GGTCCGCGAC CCCGAGCGGG CGCTGGTCGC GGCCGGTGAG ACCGTGCCCG ACTGGTCGGG CGGCACCCGC CTGGGCGAGA CGTTGCGGAT CTTCCTGGAC CGGTGGGGCC AGCGCGGCCT GGCGCGCGGC GCGGTCGTCG TCGTGTTCAG CGACGGCTGG GAGCGCGGCG ACCCGACGCT GCTCGGCGAG CAGATGGCGC GGCTGCAGCG GGTCGCGCAC CGGGTGGTGT GGGTCAACCC GCACCGCGGC AAGGACGGCT ACGAGCCCGT GCAGCAGGGC GTGGTCGCCG CGCTGCCGCA CTGCGACGAC TTCGTCGCCG GGCACTCCCT GGCGACCTTC GCGGAGCTGG TGGAGGTGGT GTCCCGTGCG TGA
|
Protein sequence | MTDRLPDEIL LAFARALRAA GVPVTHDRAQ GFLAAAAELG VADQRATYLA GRATLCAGPD DLERYDQVFH AFFNARDGLP RSRPAHPGAP FAGLPLEEST GTGEGQDEEV LRALASETEV LRQRDVASLS AAEKHRLAGM FATLRPRPPT RRTARHQAWH RGEVDASRTL RASLRRMGEP ADIAWRRRRR RPRRVVLLVD VSGSMSGYAD ALLRLAHRFT QVGRESGGVV ETFTVGTRLT HLTRAMRVRD PERALVAAGE TVPDWSGGTR LGETLRIFLD RWGQRGLARG AVVVVFSDGW ERGDPTLLGE QMARLQRVAH RVVWVNPHRG KDGYEPVQQG VVAALPHCDD FVAGHSLATF AELVEVVSRA
|
| |