Gene Noca_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1402 
Symbol 
ID4595875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1480977 
End bp1482956 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content69% 
IMG OID639776000 
Productputative glycosyltransferase 
Protein accessionYP_922603 
Protein GI119715638 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTCA CCCGCCTGCT CCAGCGACAG ATCCTGCCCG TCGACCGCGA CTTCGACGTG 
CTCGCGCTGT ACGTCGACCC CGAGGACGCC AAGCTCGACG CGGACAAGTA CGAGATCGGC
GGCAGCCGCG CGGCCAAGGA CCTCAACAAC GCCGCGATCC GCCAGTCCAC CGCGACCGGC
CACACGATCC ACCCCGACCA GATCGAGTCC CGCACCGCGC TGCGGGTCAA GTCGGGCGAC
CGGCTCTCGT TCGGCACCTA CTTCAACGCC TTCCCCGCCA GCTACTGGCG CCGCTGGACG
ATCGTCAAGG ACGTCACCTT GACGATCACC GTCGCGGGCC GGGGCGCCAC CGTCCTGGTC
TACAAGTCGA TGGCCAAGGG CCACTCGCAG CGCGTCGCGT CCGCCGACAC CGGCGCCGAG
GGCCGCAGCA CCTTCAGCTT CGACCTCAGC CTCAAGCCGT TCGTGGACGG CGGCTGGTAC
TGGTACGACA TCATCGCCGG CGACGACGAC GTGGTGGTCG AGAGCGCCGA GTGGAGCGCC
GAGGTGCCCG AGGACCGGGC CGAGCACGGC ACCGTCGACA TCGCGATCAC CACGATGAAC
CGTCCCGACT TCTGCGCGAA GCTGCTCGGC CAGCTCGGCG ACGACCAGGA CGTGCGGCCC
TACCTCGACA CCGTCTTCGT CATGGAGCAG GGCACCGACA AGGTGGTCGA CTCGCCGGAC
TTCGCGAAGG CCCAGGGCGC GCTCGGCGAC CTGCTGCGCG TGATCGAGCA GGGCAACCTC
GGCGGCTCCG GCGGCTACGC CCGCGGCCAG CTCGAGTCGG TCCGCAAGGG CACCGCGACG
TACACGATGA TGATGGACGA CGACATCGTC TGCGAGCCCG AGGGCGTGAT CCGGGCGATC
ACCTTCGCCG ACCTGGCCCG CCGCCCCACC ATCGTCGGCG GCCACATGTT CAACATCTAC
TCCCGCTCCC GGCTGCACAG CTTCGGCGAG ATCGTCCAGC CGTGGCGGTT CTGGTGGCAG
TCGCCGCTGG ACACCTACAG CGACTGGGAC CTCGCCGGGC GCAACCTGCG CTCGAGCCGG
TGGCTGCACA AGCGCATCGA CGTGGACTTC AACGGCTGGT TCATGTGCCT GGTACCGCGG
CAGGTGCTCG AGGAGATCGG GCTCTCGCTG CCGCTGTTCA TCAAGTGGGA TGACTCCGAG
TTCGGGCTGC GCGCCAAGGA GGCCGGCTAC CCCACGGTGA CCTTCCCCGG CGCGGCGGTC
TGGCACGTGC CGTGGACCGA CAAGAACGAC GGGTTGGACT GGCAGGCCTA CTTCCACCAG
CGCAACCGGT TCGTCGCCGC GCTGCTGCAC TCGCCGTACC CCAAGGGGGG TCGGATGGTG
CGGGAGAGCC GCAACCACCA GATCTCCCAC TTGGTCTCGA TGCAGTACTC CACGGTCCAG
ATCCGCCACC AGGCGCTGCT CGACGTGCTG GCCGGGCCGG ACAAGCTGCA CGAGATGCTC
CCGACCCGCC TCGCCGAGAT CAACGCGATG CGCAAGCAGT ACACCGACGC CCAGCTCGAG
GCGGACCCGG ACGCGTTCCC GCCGATCCGG CGCAAGAAGC CGCCGCGCAA GGGGCGCGAC
GGCAGCGAGA TCCCCGGGCG CCTCTCCCAG CTGGTCAGCG CCGGCCTCCA GCCGCTGCGC
CAGCTCAAGC CCCCGCGCGA GCTCGCCCAG GAGCACCCCG AGGCCGAGAT CCGCGCGATG
GACGCCAAGT GGTACCGGTT GGCGTCGTAC GACTCCGCGA TCGTCTCGAT GAACGACGGC
GCCTCCGCGG CGTTCTACCG GCGCGACCCC CAGCTGTTCC GCGAGCTGAT GGTCAAGACC
ATCGAGATCC ACGAGCGGCT CAAGCGCGAG TGGCCGCGGC TGGCCGAGGA GTACCGCGCC
AAGCTCGGGG AGGTCACCTC GCCGGAGGCA TGGGAGGAGA CCTTCCGGCC GTGGACGTGA
 
Protein sequence
MTVTRLLQRQ ILPVDRDFDV LALYVDPEDA KLDADKYEIG GSRAAKDLNN AAIRQSTATG 
HTIHPDQIES RTALRVKSGD RLSFGTYFNA FPASYWRRWT IVKDVTLTIT VAGRGATVLV
YKSMAKGHSQ RVASADTGAE GRSTFSFDLS LKPFVDGGWY WYDIIAGDDD VVVESAEWSA
EVPEDRAEHG TVDIAITTMN RPDFCAKLLG QLGDDQDVRP YLDTVFVMEQ GTDKVVDSPD
FAKAQGALGD LLRVIEQGNL GGSGGYARGQ LESVRKGTAT YTMMMDDDIV CEPEGVIRAI
TFADLARRPT IVGGHMFNIY SRSRLHSFGE IVQPWRFWWQ SPLDTYSDWD LAGRNLRSSR
WLHKRIDVDF NGWFMCLVPR QVLEEIGLSL PLFIKWDDSE FGLRAKEAGY PTVTFPGAAV
WHVPWTDKND GLDWQAYFHQ RNRFVAALLH SPYPKGGRMV RESRNHQISH LVSMQYSTVQ
IRHQALLDVL AGPDKLHEML PTRLAEINAM RKQYTDAQLE ADPDAFPPIR RKKPPRKGRD
GSEIPGRLSQ LVSAGLQPLR QLKPPRELAQ EHPEAEIRAM DAKWYRLASY DSAIVSMNDG
ASAAFYRRDP QLFRELMVKT IEIHERLKRE WPRLAEEYRA KLGEVTSPEA WEETFRPWT