Gene Noca_4583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4583 
Symbol 
ID4598681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4852004 
End bp4853647 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content76% 
IMG OID639779192 
Productglycosyl transferase family protein 
Protein accessionYP_925765 
Protein GI119718800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.474593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACC AGCCCCGGGT GCGCCGCAAC GACTGGGGCA CCCTCGACGC GCCCGCCCTC 
GGCCGGTGGG AGCCGACCCG GTCGGTGTCC GTGGTGATCC CGGCGTACGG CGCGCAGCGG
CTGCTCCCCT ACGTGCTCGC CGGCCTCGCC GCCCAGACCT ACCCGGCGCA CCTGCTCGAG
GTGGTGGTCG CCGACGACGA CCCGGCGCGC CCCCTCGAGC TGTCCGAGCT GACCGGGCCG
CGGCCCGAGC GCACCCGGAT CGTGCGCGTC GAGACCGGCT GGGGCCGGGC CAACGCCTGC
CACACCGGCG CCCTCGCCGC CGACGGCGAG GTGCTGCACT GGCTCGACGC CGACATGCTG
GTCGAGCCCG AGGAGGTCGA GGCCCAGCTG CGCTGGCACC ACCTGATCGA CCACGCGGTC
GTGCTCGGGC ACAAGTGGTT CGTGGACCCG GATTCCTTGC TGGCAAGGGA TCCCGACGCG
AGCGGGCGGA TGCACGAGGT GTTCGCCACT GCCGAGAAGG AGCGGCACTG GGTCGAGGAC
GTGTGGGACC GGACCGACGA CCTGCGCACC GCCGGCCCGC GCGCCCTGCG CACCCACGTG
GGCGCGACGG CGTCCCTGCG CCGGACGCTG TACGACGAGG CCGGCGGCAT GGACACCGGC
CTGCGGCTCG GCGAGGACAT CGACCTCGGC TACCGGCTGG CCGAGTGCGG TGCGGTCTTC
GTCGCCGACC GGGAGGCGCG CAGCTGGCAC CTGGGCCGTT CGCACCAGCA GAGCCGCCAG
GAGGAGGTCA ACGACTACAA CACCCACTTC CTCGCCGACC GGGTCCCCGA CCTGCAGCCG
CAGCGCCGCC GCGGCCGGCT CTACTCCGTG CCCTACCTCG AGGTGGTGCT GGACACCCGC
GGCCTGGACC ACACGGCGGT GATCGCCACC GTCGACTCGG TGCTCGGCTC CTCGCTCACC
GACCTCTCGG TGACGCTGCT CGGTGACTGG TCGCGGATCA CCGAGGAGCG CACCCACCCG
CTCGACGACG AGCAGCGGGC GCCGCGGCTG GTGCGGGCGG CGTACGCCGG CGAGCCCCGG
GTGCTGCTCG CCGAGTCGTC GGGCCGGCGC CGCGGCCAGT TCCGGCTGGT GCTCGCCAAC
GCCGACTGGG CGCCGACGCC CGAGACGCTC GGCACGCTGG TGCTGCACCT GGAGCGCACC
CACCACGGCC TGCGCCAGGT GCTGATGGCC GACGGCTCGG TGGCGCGGGT CGAGCGGACC
GCGGCGTACG CCCGGGCCGC CAAGGTGGCC GGCCCCGAGG GGAATCCCGC CGACCCCGAG
CGCCTGGACG ACCTGGTCGA CGAGCTGTTC GGGTCGTGGT GGGTCGAGGG CGGGGACGCC
GGGTTCGAGC CCAGCGTGAC GATCAAGCGG CCGCGGCTGC GCGGCACCGC CGGGCCGGCC
CAGGACCCGG CCGAGTCCTG GCGCGAGCTC GGCGGGACGG GCGGGGGATC CCCGCAGCAG
AAGGGCAGGA GGGCCGGGGC GCAGCAGGGC AAGCAGGCGG CGCAGCAGGC CGGGCAGCCG
GCCCAGCCGG AGCCCCGGCG GCAGGCCCCG GCGCGCCCCG GGCGGTCGAG GATCGGTTCG
CTCCTGCGCC GCGGGCGACG CTAG
 
Protein sequence
MSDQPRVRRN DWGTLDAPAL GRWEPTRSVS VVIPAYGAQR LLPYVLAGLA AQTYPAHLLE 
VVVADDDPAR PLELSELTGP RPERTRIVRV ETGWGRANAC HTGALAADGE VLHWLDADML
VEPEEVEAQL RWHHLIDHAV VLGHKWFVDP DSLLARDPDA SGRMHEVFAT AEKERHWVED
VWDRTDDLRT AGPRALRTHV GATASLRRTL YDEAGGMDTG LRLGEDIDLG YRLAECGAVF
VADREARSWH LGRSHQQSRQ EEVNDYNTHF LADRVPDLQP QRRRGRLYSV PYLEVVLDTR
GLDHTAVIAT VDSVLGSSLT DLSVTLLGDW SRITEERTHP LDDEQRAPRL VRAAYAGEPR
VLLAESSGRR RGQFRLVLAN ADWAPTPETL GTLVLHLERT HHGLRQVLMA DGSVARVERT
AAYARAAKVA GPEGNPADPE RLDDLVDELF GSWWVEGGDA GFEPSVTIKR PRLRGTAGPA
QDPAESWREL GGTGGGSPQQ KGRRAGAQQG KQAAQQAGQP AQPEPRRQAP ARPGRSRIGS
LLRRGRR