Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2479 |
Symbol | |
ID | 4597104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2645586 |
End bp | 2646584 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639777084 |
Product | HAD family hydrolase |
Protein accession | YP_923670 |
Protein GI | 119716705 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.19473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGGCA CGTCGGAGCA GGTCCTGTGT CGGGCCTACG ACCTGGCCAT GTTGGACCTC GACGGGGTCG TCTACGTCGG CGGGGACGCG GTGCCGCGCG CGCCGGAGCA CCTCGCTTCC GCCCGCGCTG CCGGCATGCG GCTGGCGTTC ATCACGAACA ACGCGGCCCG GTCGCCCGGC ACCGTCGCCG CGCACCTGTC CGAGCTCGGT GTCCCGGCCG AGGACGCGGA CGTGGTCACG TCGGCCCAGG CGGCCGCGCA CCTGGTGCTC GAGCGGGTCG GCGCCGGCGC CCGCGTGGTG TGCCTCGGAG CCGAGGGCCT GCGGGAGGCG GTGGACGCCG TCGGCCTGGT GCCCGTGGGC CCCGATGACG AGGCAGCGGC CGTCGTGACC GGCTACGGTC CCGACGTCCG CTGGCGGGAC ATCATGCGGG TCGCGGTGCG CATCCGCGAC GGCCTGCCCT GGGTCGCGAG CAACACCGAC CTGACGTTCC CGGCCGCATT CGGGGTCGCG CCTGGTCACG GGGTGCAGGT GGACATGCTG CGCCGGTTCT CCGCTGTCGA TCCCGCCGTC GCCGGCAAGC CTGCCCGCCC GCTCCTCGAC GAGACCGTCC GGCGGGTCGG TGGCAGGCGC CCGCTGATGG TCGGCGATCG GTTGGACACC GACATCGAAG GCGCCCGCGT CGCCGGTCTC GATTCGCTGC TGGTGCTGAC CGGCGTCACG GGCCTCGAGG AGCTGGTCGC CGCCCCCGAA CCGCTGCGGC CCACCTACCT CGCCCCCGAC CTGAGCGGGC TGTTGGAGCG GCAGGCCGCG CCGGTGCAGG CTGACGGCGA GTTCACGCTC GGCGGGTGGC GCGCTCGGGC CGACTCGGAC GGCCTACGGG TCACGGGTGG AGGCGAACCG GCGGACTGGT GGCGCGTCGT CGCCAGTGCG GCCTGGGCGC ACCTGGACCG GACCGGGGAT CCGGTCCTGG TGTCGGGTCT CGAGGCCCCC GCGCGGTAG
|
Protein sequence | MLGTSEQVLC RAYDLAMLDL DGVVYVGGDA VPRAPEHLAS ARAAGMRLAF ITNNAARSPG TVAAHLSELG VPAEDADVVT SAQAAAHLVL ERVGAGARVV CLGAEGLREA VDAVGLVPVG PDDEAAAVVT GYGPDVRWRD IMRVAVRIRD GLPWVASNTD LTFPAAFGVA PGHGVQVDML RRFSAVDPAV AGKPARPLLD ETVRRVGGRR PLMVGDRLDT DIEGARVAGL DSLLVLTGVT GLEELVAAPE PLRPTYLAPD LSGLLERQAA PVQADGEFTL GGWRARADSD GLRVTGGGEP ADWWRVVASA AWAHLDRTGD PVLVSGLEAP AR
|
| |