Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3710 |
Symbol | |
ID | 4597627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3939106 |
End bp | 3940368 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639778318 |
Product | exopolysaccharide biosynthesis protein-like |
Protein accession | YP_924897 |
Protein GI | 119717932 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0492868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGTACCC GTCTCCTCTC GGTCAGTCTT GCCGCCTGCC TGCTCGCGGC CACGGGGCCC GCTCTCGCGG GCGACCCGGG CAGCGACGCG TCGGCGGCCG GCCAAGCGGT CGCGCGCGGT GGACACGGGA ACCGCCCCGA CGTCCCCCGC ACGCTGCACC CGCAACACAC CTCGGGGGGC GAGGAGGGTG AGGTCGCGGC GGCGCTGCCG TCGATGCGCC GCACGGCCAG CCAGAACTGG CGGACCAGCT GGCAGGTCGC CCCGGGCGTG AAGTTCACCC GCTGGAGCCA GACCGACGCG CGCGGACCGA TCGTCGCGCA CCTGCTCACG ATCGACCCGA AGACACCGGG GCTGCGCATC GACTACGCCA GCATGGGCGC CGTACGTCGC GTCGCGCCGG TCCGCGACAT CCTCGCCGTC GACAACGCCG TCGCCGGCGT CAACGGCGAC TTCTACGACA TCGGCCACAC CGGCGCCCCC CTCGGCCTGG GCAAGGACCG GCAGCGCGGG CTGCTGCACG CCCGGGAGGA CGGCTGGAAC AAGGCGTTCT TCATCAATCG CCACGGTCGG GCCGGCATCG GCGACCTGCC GATGACGGCG CGCGTGCTCT ACCACCCGAA GCTGAAGGTC ACGAACCTGA ACTCGCCCTT CGTGATGCCG GGCGGCATCG GCATCTACAC CCCGCGATGG GGCCGCACCG CCGGGTACGG CGTCACCCAG GGCCAGACCG AGCGGGTGCG CGCGGTGACC GTCGTCAACG GCCGGGTGCG GACCAACCGC GCGAAGCTCA GCCACGACCA GCCGATCAAG GGCCTGCTGT TCATCGGCCG CGGCGAGGGC GCCAAGGTGC TGCGCAAGCT GCCCAAGCAC ACCCGGATCA AGGTCCGATG GTCGCTCCAG GGACGCCCGC AGATGGCCAT CAGCGGGAAC AACTTCCTGG TCCACGACGG CATCATCCGC GCGATCGACG ACCGCGAGAT GCACCCGCGC ACCGCGGTCG GAGTCGACTC CGACACCGGC GAGGTGCTGC TGCTGGTCGT CGACGGCCGC CAGGCCGACA GCCGCGGCTA CACGATGGTG GAGCTCGCGA ACCTGATGGT CGACCTGGGT GCCGACGAGG CCGTGAACCT CGACGGTGGC GGCTCGTCGA CGATGGTCGG CAAGAACCGC AGGGGGAAGG TGGCGGTCCT CAACGACCCC TCCGACGGCT TCCAGCGCTG GGTCGCGAAC GCGATCGAGG TGACCTACTC CCCGCCGTCC TGA
|
Protein sequence | MRTRLLSVSL AACLLAATGP ALAGDPGSDA SAAGQAVARG GHGNRPDVPR TLHPQHTSGG EEGEVAAALP SMRRTASQNW RTSWQVAPGV KFTRWSQTDA RGPIVAHLLT IDPKTPGLRI DYASMGAVRR VAPVRDILAV DNAVAGVNGD FYDIGHTGAP LGLGKDRQRG LLHAREDGWN KAFFINRHGR AGIGDLPMTA RVLYHPKLKV TNLNSPFVMP GGIGIYTPRW GRTAGYGVTQ GQTERVRAVT VVNGRVRTNR AKLSHDQPIK GLLFIGRGEG AKVLRKLPKH TRIKVRWSLQ GRPQMAISGN NFLVHDGIIR AIDDREMHPR TAVGVDSDTG EVLLLVVDGR QADSRGYTMV ELANLMVDLG ADEAVNLDGG GSSTMVGKNR RGKVAVLNDP SDGFQRWVAN AIEVTYSPPS
|
| |