Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0904 |
Symbol | |
ID | 4599586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 944622 |
End bp | 947294 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639775505 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_922114 |
Protein GI | 119715149 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.145803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATCG CCTACGTCGT CTTCAACCTC GACGGGATGG GCGGCACCTC CCGCTCCGCC ATCACCCAGG CGAACGCCCT GGCCGGCGAC CACGACGTGC GGCTGGTGAG CATCACCCGC AGCGCGGACC GGCCGCACTA CGCCATCGAC CCGCGGATCA CCGTCGACTA CCTCGCGGAC GTGCGCGAGG AGGTGCTGGC CGAGGACGAC GCGGCGCGCG CGTTGAGCGA GCGCGAGTCG GTGCTGGTGC CGCCGCGCTG GGACAGCCAG TTCTCCGCGC TCACCGACGT CGCGATGAGC GCGGCCCTGC CCGGCCTGGA CGCGGATGTG GTCGTCACGG TGACACCGGC GCTGCTGGCC GCGGTGACCC AGCTGGTGCC GGACCGCGCG GTCCTCGTCC ACCAGGAGCA CCGCTCCTCC TCCGACCGCA CCTCCGGCCT GGAGCCGCTG CTCGCGTTCG CGCCCCGCGC GGACGTGGTC GCGATGCTCA CCGAGTCCAC GGCCGACTGG CTTCGCGACC AGCTCGGCGA CCTCACGCCC GAGGTCGTCG TCATGCCCAA CCCGCTGCCG CTCGGCTTCA CCCCGCGCTC CCGGCTCGAC GGGCGCCTGA TCGTCTCCGC CGGCCGGCTG GCCGTCGAGA AGCAGTTCAC CAAGCTGGCG CTGGCCTTCG GGGAGATCGC CGACCAGCTG CCCGGCTGGC GGCTGCGGAT CTTCGGCGAC GGCGCGCAGC GCAACGAGCT GCGCCGCCAG GCCCGCAAGC TCGGGCTCTA CGACCGGCTG GAGCTGCCCG GCAGCACCAC GGACATGGCC GGTGAGTGGG CCAAGGCGAG CATCGCCGCC CTCACCTCGA GGGCCGAGGG CTTCCCGCTG GTGCTCCAGG AGGCGATGGC CGCCGGCGTC CCGGTCGCCA GCTTCGACTG CCCGTCCGGG CCCCGCGAGA TCGTCGAGCA CGACGTCAAC GGCCTGCTCG TCGCGCCCGA CTCGGTCGCC GGCATGGCGA GCGCGCTGCT GCGCCTGGCG ACCGACGACG AGCTGCGCGC GCGGCTCGGC GCCGGCGCCC TGCACTCCTC ACGGCAGTAC GACGCCCGTG CGCTGGCCGA GCAGTGGGTC GGGATCTTCG CCGACGCCCG GGCCCGGCGC GGCACGCTCG GTCGGGTGGC GGCGCGGGTC GGGGCGCCCC GGCACCGGTC GGTCGGTCCG GGGCCGACGG CCGACATCAC CGGCATCACC CCGGCGCAGG CGCGCCACGC GGCGCTCGCC GCGGCGGTCG CCGCCGCGCG CTCGGCCACC GAGGAGTGGC TGGTGATCCC CGAGCACGAG TCGGCGGCGC CGGCGGTCGT GGTGCCGATG ACCGCCCGCG ACGCGGTCCT CGCCGCGCTC GCCGAGGCCG AGCCGCCGGC GTACCTCTGC CTGCGCGAGC CCGCGGCCCA GGGCTGGCCC GAACGGCGCG GACCGGTGGG CGCGATGGCC ACCGAGCTGC GCCGCGGCCG CACCGGGTCG GTGTTCCTCG AGCCGTGGCC CGAGGACGGC GAGCACGCCA GCGTGCTGGG CCAGGGGTGC GGGGTCGGGC TGGAGTTCTG GGAGACCAGC GTCGAGGACG AGCTGGTCGC GGCGCGGCCG AACCGCTACA CCCGCCGGAT CCCCCGCGGC ACCCCCACGG TGGACACCGA GATCGGCGGC GTCGCCGTAC GCACGCTGCC GCTGATGACC GAGCGCACCG TCAACGAGTG CGGCTTCCCG ATCGACGTCG TCTACACGTG GGTCGACGGC AACGACCCGG TGTGGAACGC CGCGCGCGAG GACCGGCTGG CGCGGCTCAG CGGCACCGCC CTGACCCGCG AGTCCAGCGG CCGCGCGCGG TTCGTCTCGC GCGACGAGCT GCGCTACTCG ATGCGCAGCG TGCACCTGTT CGCACCGTGG GTGCGCCGCA TCCACCTCGT CACCGCCGGC CAGGTGCCGG ACTGGCTGGA CACCTCGCAC CCGTCGATCC GGGTCGTCGA CCACGCCGAG ATCCTCCCGG CCGGGGCGCT GCCGACGTTC AACTCGCACG CGATCGAGAC CGGGCTGCAC CACGTGCCCG ACCTGACCGA GCACTTCGTC TACCTCAACG ACGACGTCTT CCTCGGCCGG CCGGTGCGGC CCGAGATCTT CTTCAGCCCG GCCGGGCTGT TCGCGGCCTT CATGTCGCCC ACGCAGGTCG GGCTGGAGGA CGTGCCCGGC GCCGCGCCGT TCCTGAAGGC GGCCTGGAAC AACCGGCACC TGCTGCAGGA GGCGTTCGGC GTCGTCACCA CCAACAACCT CGCGCACACG CCGCACCCGC ACCGCCGCTC GGTGCTCGAC GAGGTCGAGC AGCGGTTCGC CGACGCGGTC GCGGGGACCG CGGCGGCGCC GTTCCGCTCC GACACCGACG TCTCGATGCT CAGCTCGCTG GCGCAGCACT ACGGCCTGGC GACGGGCACG GCCTACCTCG GCGAGGCGGC CTTCGAGTTC GTCAACCTCA GCAACAGCGA CCTGCTCCGC CAGCTCAACC AGCTGAGGGC GCGCCAGCAG GACTTCTTCT GCCTCGGCGA CCACCACGAC TACGCGATGG TGGCCGCCCG CCTGGACCAG GAGCTCGCGG CGTTCTTCTC GTCGTACTTC CCGGTCGCGG CGCCGTGGGA GGTCAAGCCT TGA
|
Protein sequence | MRIAYVVFNL DGMGGTSRSA ITQANALAGD HDVRLVSITR SADRPHYAID PRITVDYLAD VREEVLAEDD AARALSERES VLVPPRWDSQ FSALTDVAMS AALPGLDADV VVTVTPALLA AVTQLVPDRA VLVHQEHRSS SDRTSGLEPL LAFAPRADVV AMLTESTADW LRDQLGDLTP EVVVMPNPLP LGFTPRSRLD GRLIVSAGRL AVEKQFTKLA LAFGEIADQL PGWRLRIFGD GAQRNELRRQ ARKLGLYDRL ELPGSTTDMA GEWAKASIAA LTSRAEGFPL VLQEAMAAGV PVASFDCPSG PREIVEHDVN GLLVAPDSVA GMASALLRLA TDDELRARLG AGALHSSRQY DARALAEQWV GIFADARARR GTLGRVAARV GAPRHRSVGP GPTADITGIT PAQARHAALA AAVAAARSAT EEWLVIPEHE SAAPAVVVPM TARDAVLAAL AEAEPPAYLC LREPAAQGWP ERRGPVGAMA TELRRGRTGS VFLEPWPEDG EHASVLGQGC GVGLEFWETS VEDELVAARP NRYTRRIPRG TPTVDTEIGG VAVRTLPLMT ERTVNECGFP IDVVYTWVDG NDPVWNAARE DRLARLSGTA LTRESSGRAR FVSRDELRYS MRSVHLFAPW VRRIHLVTAG QVPDWLDTSH PSIRVVDHAE ILPAGALPTF NSHAIETGLH HVPDLTEHFV YLNDDVFLGR PVRPEIFFSP AGLFAAFMSP TQVGLEDVPG AAPFLKAAWN NRHLLQEAFG VVTTNNLAHT PHPHRRSVLD EVEQRFADAV AGTAAAPFRS DTDVSMLSSL AQHYGLATGT AYLGEAAFEF VNLSNSDLLR QLNQLRARQQ DFFCLGDHHD YAMVAARLDQ ELAAFFSSYF PVAAPWEVKP
|
| |