Gene Noca_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2065 
Symbol 
ID4595819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2209172 
End bp2211424 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content77% 
IMG OID639776668 
Productglycosyl transferase, group 1 
Protein accessionYP_923261 
Protein GI119716296 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein
[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTT CATCCGGTGC GGCCTACGGT CCCGGCATGC ACGCGGTCGC CGGCCAGCCC 
ATCGAGGCCC GGTCCCTGGC GCGGACCTGG CGCCGGCTCC GCTTCGTCGG CTCCTGGGCA
CTCGCGCTCG GCCTGGTCGC GGTCGGGATC CCGCAGGCGG TCGACGTGTC CTGGCACGGC
GTGCTGCCGG TGCTCCGGTC GCTGCACTGG CCGGCCGTCC TCGGCCTGGT GGTGCTGTGG
TTCCTCGGCC TGGTCGTGCA CTCCAGCGTG CTGACCGCCG CGGCGCCCAG CCTCACCCAC
CGCCGGGCAC TGGCGCTGAA CCTGACCGGC AGCGCCGTCG CGAACGTCGT CCCGCTCGGC
GGGGCGGCCG GCGTCGAGCT GAACCGCCGG ATGATGCGTG CCTGGGGCAT CGACGGGCGC
CGGTTCGCCG GCTACACCTT CCTCACCAAC CTCGTCGACA TCGGCGCCAA GCTGGTGCTG
CCCGTGATCG CCGTCGTCGC GCTGGCGCAC GCGGGCGAGT CGGTGACCGC GTCCCTGCGA
TACACCGCGG TGCTGGCCGG CCTGGCGTTC GTCGGCCTCG CCGCCTGCGC CGCGGCGGTG
CTGGCCAGCC CCCGCTGCAC GGTCGGCGTG GGCCGGGCCG TCGAGTGGGT CGCGCGACCG
GTGCTGCGCG TGCTGCGACG CACCACCGAC CTCGACGTGG CCGGGCCGCT GCTCGACGTA
CGCCGGGAGT GTGCGCAGGT CGTCTCCCGC GGCTGGCTGC GGATGTCGCT GGGCATGGGC
GGGTACGTCG CGCTCCAGGG CCTCCTGCTC GGGCTGTGCC TGCACGTGAC CCACAGCGGC
GTCACCTGGA CCGAGGTCCT CGCCGGCTTC GCCCTGGAGC GCACGCTGAC GGTGCTGCCG
GTGACCCCCG GCGGCGTCGG CATCGCCGAC GTCGGCCTGG TCGGCATCCT GATGGCGCTC
GGCGGCGACC CGGCCGGCGC TGCGGCCGGC GCGGTGCTCT ACCGCGGCTT CGTCTTCGCC
CTCGAGATCC CGGTGGGCGG TGGCACCCTG GGCGTCTGGC TGCTCGGCCG GCGCCGGGCC
GCCCGGCGGG TGCAGCACCC GGCGCGGATC GTGGGGGCCG ATCCGCGCCG GGTCGCGCAC
GTGACCGACG TGTTCCTGCC GCGGCTGGGC GGGATCGAGA CCCACGTCGA CGACCTGGTC
CGGCACCAGC GCGCGCGGGG CGTCGACGCC GTCGTGCTCA CGCCGACCGC GTCGGCGGGC
CGCGACCCGG AATGGGTGCA CCGCCTGCCG GCCGCCGCGG CCCGCCGGTT CGCCACCGAG
TACGACGCGG TGCACGTCCA CGTGTCGATG CTCTCGCCGT ACGGCCTCGG GGTGGCGCGG
GCGGCGCTCG CCGCCGGGGT GCCCACGCTG GTGACCGTGC ACTCGATGTG GGCGGGCGTG
GGCGGCCTGC TCCGGCTCGC GGCCCTCGCG CGGCTGCGGC GCTGGCCGGT CGTCTGGTCC
GCGGTCAGCG GCGCCGCCGC CGAGACCTTC CGCGGCTCCC TGCACGGCGG CGACGTCGCC
GTGCTCCCGA ACGCGATCGA CGTCGAGCAG TGGCGCCGGT CGCCGGCGCC GCCCCGCCCG
GCGCGCCCGC AGGGGCCGGG CGAGGAGGCG CCGATCACGC TGGTGAGCGT GATGCGGCTG
ATGCCTCGCA AGCGACCGCT GCCGCTGATC CGCACGTTCG AGCAGGTGCG CGCGCTCGTC
CCGGGCAGGG ACGTCCGGCT GCTCGTCGTC GGCGACGGCC CGCTGCGCGG TCGGGTGGAG
CGCTACGTCC GGCGCCGCGG GCTGGTCGGG TGCGTGCGGG TCACCGGGCG GATCCCACGG
TCGGAGGTGC TCGGCCACCT GCTCTCCGCT TCCGTGTACG TCGCCCCCGC CCCGAAGGAG
TCGTTCGGCC TGGCCGCGCT CGAGGCGCGG TGCGCCGGCC TCCCGGTGGT GGCGAACCGC
CGCAGCGGGG TCGGCGAGTT CATCCGCGAC CGGGTGGACG GCATCCTGGT CGCCGACGAC
GCCGAGATGG TCGTGGCGCT GGCCGACCTG GTCCTCGACC CCGGGCTCCG CGAGCGGATC
GCCGAGCACA ACCGGCGGGT GGCCCCGGCC TTCGACTGGT CCGACGCCCT CGACCGGACC
GAGGCGCTGT ACCGGCTCGC CGGCGAGCGG CTGGCCGCGC CGGCCCGCAC CGCCGAGCCG
CTCGTCCCGG CGCTGCTCGA GGCCCAGGCC TAG
 
Protein sequence
MKASSGAAYG PGMHAVAGQP IEARSLARTW RRLRFVGSWA LALGLVAVGI PQAVDVSWHG 
VLPVLRSLHW PAVLGLVVLW FLGLVVHSSV LTAAAPSLTH RRALALNLTG SAVANVVPLG
GAAGVELNRR MMRAWGIDGR RFAGYTFLTN LVDIGAKLVL PVIAVVALAH AGESVTASLR
YTAVLAGLAF VGLAACAAAV LASPRCTVGV GRAVEWVARP VLRVLRRTTD LDVAGPLLDV
RRECAQVVSR GWLRMSLGMG GYVALQGLLL GLCLHVTHSG VTWTEVLAGF ALERTLTVLP
VTPGGVGIAD VGLVGILMAL GGDPAGAAAG AVLYRGFVFA LEIPVGGGTL GVWLLGRRRA
ARRVQHPARI VGADPRRVAH VTDVFLPRLG GIETHVDDLV RHQRARGVDA VVLTPTASAG
RDPEWVHRLP AAAARRFATE YDAVHVHVSM LSPYGLGVAR AALAAGVPTL VTVHSMWAGV
GGLLRLAALA RLRRWPVVWS AVSGAAAETF RGSLHGGDVA VLPNAIDVEQ WRRSPAPPRP
ARPQGPGEEA PITLVSVMRL MPRKRPLPLI RTFEQVRALV PGRDVRLLVV GDGPLRGRVE
RYVRRRGLVG CVRVTGRIPR SEVLGHLLSA SVYVAPAPKE SFGLAALEAR CAGLPVVANR
RSGVGEFIRD RVDGILVADD AEMVVALADL VLDPGLRERI AEHNRRVAPA FDWSDALDRT
EALYRLAGER LAAPARTAEP LVPALLEAQA