Gene Noca_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3710 
Symbol 
ID4597627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3939106 
End bp3940368 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content72% 
IMG OID639778318 
Productexopolysaccharide biosynthesis protein-like 
Protein accessionYP_924897 
Protein GI119717932 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0492868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTACCC GTCTCCTCTC GGTCAGTCTT GCCGCCTGCC TGCTCGCGGC CACGGGGCCC 
GCTCTCGCGG GCGACCCGGG CAGCGACGCG TCGGCGGCCG GCCAAGCGGT CGCGCGCGGT
GGACACGGGA ACCGCCCCGA CGTCCCCCGC ACGCTGCACC CGCAACACAC CTCGGGGGGC
GAGGAGGGTG AGGTCGCGGC GGCGCTGCCG TCGATGCGCC GCACGGCCAG CCAGAACTGG
CGGACCAGCT GGCAGGTCGC CCCGGGCGTG AAGTTCACCC GCTGGAGCCA GACCGACGCG
CGCGGACCGA TCGTCGCGCA CCTGCTCACG ATCGACCCGA AGACACCGGG GCTGCGCATC
GACTACGCCA GCATGGGCGC CGTACGTCGC GTCGCGCCGG TCCGCGACAT CCTCGCCGTC
GACAACGCCG TCGCCGGCGT CAACGGCGAC TTCTACGACA TCGGCCACAC CGGCGCCCCC
CTCGGCCTGG GCAAGGACCG GCAGCGCGGG CTGCTGCACG CCCGGGAGGA CGGCTGGAAC
AAGGCGTTCT TCATCAATCG CCACGGTCGG GCCGGCATCG GCGACCTGCC GATGACGGCG
CGCGTGCTCT ACCACCCGAA GCTGAAGGTC ACGAACCTGA ACTCGCCCTT CGTGATGCCG
GGCGGCATCG GCATCTACAC CCCGCGATGG GGCCGCACCG CCGGGTACGG CGTCACCCAG
GGCCAGACCG AGCGGGTGCG CGCGGTGACC GTCGTCAACG GCCGGGTGCG GACCAACCGC
GCGAAGCTCA GCCACGACCA GCCGATCAAG GGCCTGCTGT TCATCGGCCG CGGCGAGGGC
GCCAAGGTGC TGCGCAAGCT GCCCAAGCAC ACCCGGATCA AGGTCCGATG GTCGCTCCAG
GGACGCCCGC AGATGGCCAT CAGCGGGAAC AACTTCCTGG TCCACGACGG CATCATCCGC
GCGATCGACG ACCGCGAGAT GCACCCGCGC ACCGCGGTCG GAGTCGACTC CGACACCGGC
GAGGTGCTGC TGCTGGTCGT CGACGGCCGC CAGGCCGACA GCCGCGGCTA CACGATGGTG
GAGCTCGCGA ACCTGATGGT CGACCTGGGT GCCGACGAGG CCGTGAACCT CGACGGTGGC
GGCTCGTCGA CGATGGTCGG CAAGAACCGC AGGGGGAAGG TGGCGGTCCT CAACGACCCC
TCCGACGGCT TCCAGCGCTG GGTCGCGAAC GCGATCGAGG TGACCTACTC CCCGCCGTCC
TGA
 
Protein sequence
MRTRLLSVSL AACLLAATGP ALAGDPGSDA SAAGQAVARG GHGNRPDVPR TLHPQHTSGG 
EEGEVAAALP SMRRTASQNW RTSWQVAPGV KFTRWSQTDA RGPIVAHLLT IDPKTPGLRI
DYASMGAVRR VAPVRDILAV DNAVAGVNGD FYDIGHTGAP LGLGKDRQRG LLHAREDGWN
KAFFINRHGR AGIGDLPMTA RVLYHPKLKV TNLNSPFVMP GGIGIYTPRW GRTAGYGVTQ
GQTERVRAVT VVNGRVRTNR AKLSHDQPIK GLLFIGRGEG AKVLRKLPKH TRIKVRWSLQ
GRPQMAISGN NFLVHDGIIR AIDDREMHPR TAVGVDSDTG EVLLLVVDGR QADSRGYTMV
ELANLMVDLG ADEAVNLDGG GSSTMVGKNR RGKVAVLNDP SDGFQRWVAN AIEVTYSPPS