Gene Noca_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0723 
Symbol 
ID4599827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp769096 
End bp770184 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID639775324 
Productglycoside hydrolase family protein 
Protein accessionYP_921935 
Protein GI119714970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCCG GCGTCCCCGC GAACACCGGG GCCGCACATC CCGTGGTCGA CGGCAACCGG 
ATCATCGACG CCCGCAGCGG TCGCGACTTC GTGCCGCGTG GGGTCAACTG GAGCAGCTTC
GAGTACGCCT GCGCCCAGGG CTGGGGCATG TCGGCGCTCG ACAACCTGGT CGCCCGGGAC
CCGGCCACGA CGGAGGCGAA GGAGATCGCC GGCTGGGGTG CGAACACGGT CCGGCTCCCC
CTCAACCAGG ACTGTTGGCT CGGCACCCGC GGAGCACCGG TCAGCGACCA GTACGAGGAA
CGCTCGGTGG CCGGCTACCG CAGCGACGTG CACGCGTTCG TGACCGCGCT CAACCGGGCC
GGCATCGTGG TCGTGCTGGA CCTGCACAGC CGCAAGCGGA TCGGGCAGCC GGAGTTCGGC
AACCTGGCGA TGCCGGACTC CGAGTCGATC GCGTTCTGGA CCTCGGTCGC CACCGAGTAC
GCCGACAACC CGTCGGTGCT GTTCGACGCC TTCAACGAGC CCTACTCCCG GTACAGCTCC
AGCGGTGCCC GGCTGTTGTT CGGGCTCACC TGGCGGTGCT GGCGGGACGG CGGTTGCCAG
GCGCCGGTCG AGGATGACCA GACCGCGACC CTCGGACAGG TCACCTATCC CGTCCAGGGC
ATGGCCGCCG TGGTCAACGC CATCCGGGAC GCCGGTGCCG AGCAGCCGAT CCTGCTCGGC
GGCCTGGACT ACGCCAACGA CATCAGCCAC TGGCTGGAGT TCGCGCCGGA CGACGACCAG
CTGGTGGCGG CGTTCCACTC CTACGACTTC AAGGCGTGCG GCGACCCGGA CTGCTGGAAC
GACGTCATCG CCCCGGTCGC CGAGACCGTG CCCGTGCTGA CCTCCGAGCT CGGGGCCCGC
CATCCCGAGA ACGGGTACGT CGGGCGCTAC CTCGACTGGG CCGACGAGCA CAACCTGGGT
GTGCTGTTCT GGGTCTGGGC GGACCATCCC GGCGACCCGA TGGCACTGGT CACCGACGAG
CGCGGTCGGC CGACGGCGTA CGGCCGGGCT GCGCGCACCT GGCTGACCGG GCACCTGCCG
GCACGGTGA
 
Protein sequence
MAAGVPANTG AAHPVVDGNR IIDARSGRDF VPRGVNWSSF EYACAQGWGM SALDNLVARD 
PATTEAKEIA GWGANTVRLP LNQDCWLGTR GAPVSDQYEE RSVAGYRSDV HAFVTALNRA
GIVVVLDLHS RKRIGQPEFG NLAMPDSESI AFWTSVATEY ADNPSVLFDA FNEPYSRYSS
SGARLLFGLT WRCWRDGGCQ APVEDDQTAT LGQVTYPVQG MAAVVNAIRD AGAEQPILLG
GLDYANDISH WLEFAPDDDQ LVAAFHSYDF KACGDPDCWN DVIAPVAETV PVLTSELGAR
HPENGYVGRY LDWADEHNLG VLFWVWADHP GDPMALVTDE RGRPTAYGRA ARTWLTGHLP
AR