Gene Noca_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3497 
Symbol 
ID4595596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3705691 
End bp3707181 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID639778105 
Productglycosyl transferase family protein 
Protein accessionYP_924684 
Protein GI119717719 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGCGA ACAGCACGGG TTCGGAGGCG GTGCCGCCGC TCTCGCAGGC GGGGGCCGAC 
AGCCTGCCGT ACGTGCGCAC CGCCGACCCC GGCCTCGCCG CCTACGAGGG CCGGTTCCTC
GGCGAGGTCG AGGAGCTGCC GACGTACCGC CCGACCGTCG GGTGCATCAT CCCGGCGTAC
AACGAAGCCG AGACCATCGC CGGCGTCCTG GACTCCCTGC TCCAGCAGAC CCGCCTGCCC
GACGCGATCC ACGTCATCAT CAACAACACC AGCGACGACT CCGTCGAGAT CGCCAGCCAC
TACGCCGGCC CGCACACCCG GATGACCCCG TCCGGGGAAC AGAGCACGGT CATCTACGTG
CACGACATCG GCAAGAACCC CGACAAGAAG GTCGGTGCCC TCAACTACGG CTACTCGCTC
GTGGAGACGA TGGACTACCT CCTCGGCGTG GACGGCGACA CCACCCCGGA GCCGGACGCC
ATCGAGCACC TGGTCGACGA GATCGCCAGC GACGACCGGA TCGGCGGCAT CTCCGCGATC
TACTCGATCG ACGACAGCGC CCTGGACAGC TGGATGGCGA AGTTCCTGAT CGCGGGGCAG
CGGGCGCAGT TCTCGGCGTT CAACATGCAG AACCTGCTCA AGGGCCGCAA CATGGCGGTC
CTCGGCGGCC AGTTCTCGAT CTTCTCGACG CAAGCGTTGC GTGACGTGCT GCGCGACAGC
CACCAGCGCA CCCCGTGGGT CAACGACAGC GAGGTCGAGG ACTCGCTGCT CTCGCTGCAG
ATCAAGAGCG CCGGCTACCT CACCAAGATC AGCGCCCGGG CCCGCGCGCA CGTCGGCGGC
ATGGACACGC TGCGCTCGCT GGACGCCCAG CAGGTGAAGT GGAACTTCGG CGCGATCGAC
CTGATGTGGC CCGGCCAGCG CGGCGACACC AAGGGGCAGC CCTTCCACCC CAACCTGCGG
CTGCGGTGGT TCGAGCACAT GTCGATGGTC ATCAACATCA CCACCCGCAC GCTGTTCGTC
CTGCTGCTCG CCGGCTCGCT CAACATCCAC GCGTTCGTGT TCAGCCCGTG GTGGCTGATC
CCGCCGGCGG CCGCCGTCGG GCTGAACTTC CGCGTGGCCC GGTCGATGGC CTTCGCCAAC
CGGCGCGACT ACCTCTTCGC GGTGCTGATC GTCCCGGCGG AGGCCTACAT GGTGATCCGG
ATGGGGCACT TCATCCGGGC CTGGCTGAAG TTCTTCAGTC GGCAGCAGAC CGACAACTGG
GCCGCCCAGG CCAAGGCCGA GCGCGGCAAG GGCATCGCCT GGACCTACCC CTTCGTCGCG
TTCGGCGTCA TGTTCGCGGT GTTCGCGGTG GTCTGGACGC AGTTCCTCTC GATCCCGCTG
CGCTCCGACA TCCTGGCGGT CTGCTGGCCG ATCCTCGGCG TGATCACCGT CCTGCAGACC
GCCTGGATGA TCATCAAGGC CATGAAGCGC TACCGCGGCT TCAAGGCCTG A
 
Protein sequence
MNANSTGSEA VPPLSQAGAD SLPYVRTADP GLAAYEGRFL GEVEELPTYR PTVGCIIPAY 
NEAETIAGVL DSLLQQTRLP DAIHVIINNT SDDSVEIASH YAGPHTRMTP SGEQSTVIYV
HDIGKNPDKK VGALNYGYSL VETMDYLLGV DGDTTPEPDA IEHLVDEIAS DDRIGGISAI
YSIDDSALDS WMAKFLIAGQ RAQFSAFNMQ NLLKGRNMAV LGGQFSIFST QALRDVLRDS
HQRTPWVNDS EVEDSLLSLQ IKSAGYLTKI SARARAHVGG MDTLRSLDAQ QVKWNFGAID
LMWPGQRGDT KGQPFHPNLR LRWFEHMSMV INITTRTLFV LLLAGSLNIH AFVFSPWWLI
PPAAAVGLNF RVARSMAFAN RRDYLFAVLI VPAEAYMVIR MGHFIRAWLK FFSRQQTDNW
AAQAKAERGK GIAWTYPFVA FGVMFAVFAV VWTQFLSIPL RSDILAVCWP ILGVITVLQT
AWMIIKAMKR YRGFKA