Gene Noca_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1052 
Symbol 
ID4599658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1108659 
End bp1110617 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content75% 
IMG OID639775650 
Producthypothetical protein 
Protein accessionYP_922257 
Protein GI119715292 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAATC GTCTGGCAAC CGCGACGAGC CCATACCTGC TCCAACACGC TCAGAATCCG 
GTGGATTGGT GGGAGTGGGG GCCGGAGGCG TTCGAGGAGG CCCGTCGGCG CGGCGTCCCG
GTCCTGCTCA GCGTCGGCTA CGCCGCGTGC CACTGGTGCC ACGTGATGGC GCACGAGTCC
TTCGAGGACG AGGCGACCGC GGCGTACCTC AACGAGCACT TCGTGAGCGT CAAGGTGGAC
CGCGAGGAGC GGCCCGACGT GGATGCGGTC TACATGCAGG CGACCACGTC GATGACCGGG
CACGGCGGCT GGCCGATGAC GGTGGTGCTG GACCACGAGG GCAGCCCGTT CTTCGCCGGC
ACCTACTTCC CGGACCGGCC GCGGCACGGG CAGCCCGCGT TCCGCCAGGT GCTGGAGGCG
CTGGCCGACG CGTGGCAGAA CCGATCCGAC GAGGTGCGCC GGGTGGCGGC GAACCTGCGC
GAGCACCTGT CGTCCACCAG CCTGGCCACG GCGGGCGCGC CGATCACGCG GGCGGTCCTC
GACGGCGCGG TGCGCACGCT CGCGCTGGAG TACGACGCGG ACGCGGCGGG GTTCGGCGGC
GCGCCGAAGT TCCCGCCGTC GATGGTGCTC GAGTTCCTCC GGCGGCACGG CGAGCGCGAG
ATGCTCGGCG CGACCCTGGA GGCGATGGCC CGCGGCGGCA TCCACGACCA GCTCGGCGGC
GGCTTCGCGC GCTACAGCGT CGACACGGAC TGGGTGGTGC CGCACTTCGA GAAGATGCTC
TACGACAACG CCCTGCTGCT GCGGGTGTAC GCCGAGTGGG ACACCCCGGT CGGGGTCTGG
GCCGCCGAGG GGATCGCCGA CTTCCTGCTC GGCGAGCTCC GCACGCCGGA GGGCGGCTTC
GCCTCGGCGC TCGACGCCGA CTCCGAGGGC GCCGAGGGCA CCTACTACGT CTGGACCCCC
GCTCAGCTGA CCGAGGTGCT GGGGCCCGAG GACGGTCCGT GGGCGGCCCG GCTGCTCGGC
GTGACCGACG CCGGCACCTT CGAGCACGGC ACCTCGACCC TCCAGCTGCG GCAGGACCCC
GACGACCTCG ACCGCTGGTT CGACTGCCAG CGCCGGCTGC GCGAGGCGCG GTCGCACCGC
GAGCGACCGG CTCGCGACGA CAAGGTCGTG GCTGCCTGGA ATGGCCTGGC GATCAGCGGC
CTGTGCCGGG CCGGCGCCCT GATCGGACTC CCGGAGTACG TCGCCGCGGC GACCGCCGCG
GGCCAGCTCC TCTGGCGGGT CCACCTGGTC GACGGCCGGC TGCGGCGGGT CTCCCGCGAC
GGCGTGGTCG GCGCGCCCGC CGGCGTGCTG GAGGACAACG GCTGCGTCGC GGCCGGCTTC
CTCGACCTGC TCCAGGCCAC CGGCGACGCC GTGTGGCTGG AGCGCGCGGG CGCCATCCTG
GAGCTCGCGC TCACCCACTT CGCCGCCGAG GACGGCGGCT TCTTCGACAC CGCCGACGAC
GCGGAGGCCC TGGTCGCCCG GCCGCGCGAC CCCTCCGACA ACGCCAGCCC CTCGGGCCTC
GCCTCGATGG TGCACGCGCT GTCGACGTAC GCCGCGCTCA CCGGCTCGGG CCGCCACCGC
GACGCGGCCG AGGCGGCGCT GGCCTCGGTC GCCACCCTCG CGGAGCGGGC GCCCCGCTTC
GCCGGCTGGT CGCTCGCCGC CGCCGAGTCG ATGCTCGACG GGCCGGTGGA GATCGCGATC
GTCGGCGACT GGTCCGAGCA GCGCGACCAG CTCGAGGCAC GGGCCCGGCG GGAGCCCGGG
GCCGTCGTCG TGGTCGCGGA CCGGGCCGAC GAGGCGATCC CGCTGTTGGC CGGGCGCACG
CCGGTGGACG GTCGCGCGGC GGCGTACGTG TGCCGCCACC TGGTGTGTGA GCGCCCGGTC
AGCACCGTCG AGGAGCTGGA CGAGGCCCTG TCCCGCTGA
 
Protein sequence
MVNRLATATS PYLLQHAQNP VDWWEWGPEA FEEARRRGVP VLLSVGYAAC HWCHVMAHES 
FEDEATAAYL NEHFVSVKVD REERPDVDAV YMQATTSMTG HGGWPMTVVL DHEGSPFFAG
TYFPDRPRHG QPAFRQVLEA LADAWQNRSD EVRRVAANLR EHLSSTSLAT AGAPITRAVL
DGAVRTLALE YDADAAGFGG APKFPPSMVL EFLRRHGERE MLGATLEAMA RGGIHDQLGG
GFARYSVDTD WVVPHFEKML YDNALLLRVY AEWDTPVGVW AAEGIADFLL GELRTPEGGF
ASALDADSEG AEGTYYVWTP AQLTEVLGPE DGPWAARLLG VTDAGTFEHG TSTLQLRQDP
DDLDRWFDCQ RRLREARSHR ERPARDDKVV AAWNGLAISG LCRAGALIGL PEYVAAATAA
GQLLWRVHLV DGRLRRVSRD GVVGAPAGVL EDNGCVAAGF LDLLQATGDA VWLERAGAIL
ELALTHFAAE DGGFFDTADD AEALVARPRD PSDNASPSGL ASMVHALSTY AALTGSGRHR
DAAEAALASV ATLAERAPRF AGWSLAAAES MLDGPVEIAI VGDWSEQRDQ LEARARREPG
AVVVVADRAD EAIPLLAGRT PVDGRAAAYV CRHLVCERPV STVEELDEAL SR