Gene Noca_1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1807 
Symbol 
ID4597644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1920745 
End bp1922211 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content72% 
IMG OID639776406 
Producthypothetical protein 
Protein accessionYP_923006 
Protein GI119716041 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.932272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCCGG ACGGACGAGT GACCCACATC GAGCCGGACG TGTTCGTGCG TTACCTCGAG 
CGCACCCGGT GGTTCGGCGG CAAGGGCCGC CCCTTCGAGG TCGCCTCCGT GCGCCGGATC
GGCGAGGTGC CCCGCGACGA GGAGGACGGC GGCCCCCGTG TCGTCATCGA GCTGATCGAG
GTCGCCTACA GCGACGGCCC GGGCGGCACC GAGGTCTACC AGGTCCCGAT GTCGTTCTAC
ACCGAGCCGG AGAGCCGGCT CGACCACGCG TTCATCGGCT GGTGGGACGA GCCCGGCTAC
GGCTGGGTGC ACGCGTACGA CGCCCTCCAC GACCGCGACG CGATGGACGG CTGGCTGCGT
GCCTTCGACC GGGCGGCCCG CGAGCCGGGC GGCAACCTCA GCGACGACGA CAGCGGGCTG
CGGTTCCACC GCCTGCCCGG GCACGACCTC GACCTCGACG CGCACTCGAC GCTGTTCTCC
GGTGAGCAGT CGAACTCCTC GGTGGCGTTC GGCGAGGACT GCCTGATGAA GGTGTTCCGC
AAGATCACCC CCGGGGTGAA CCCGGACATC AGCGTCCACG AGGTGCTCAC CAACGCCGGT
TCCGACCACA TCGCGGCGCT GTACGGCTGG CTGGACTGGG TCGACTGGGA GGCCGACCCC
GAGGCCGAGG AGCGCGCCGG CACGACGATG CAGCTGGCCA TGCTCCAGCA GTTCCTGCGG
ACCGCGAGCG ACGGCTGGGA CCTGGCGCTG ACGAGCGCCC GCGACCTGTT CGCCGAGGCC
GACCTGCATG CCCACGAGGC CGGCGGAGAC TTCGCCGCCG AGGCGGCCCG GCTGGGGACC
GCGCTGCGCG AGGTCCACGA GGACCTCGCC GAGCACTTCC CGGTCGAGCG CCGCGGCCCC
GAGGCGCTCA CCGAGCTGGC CGACGCGATG TCGGCGCGGC TCGACGCCGC GCTCGAGGTG
GTGCCCGAGC TCGCCGCGCA CACCGAGACG CTACGGGCGA CGTACGACCG CGTCCGTGGA
CTCGGCGGAC TGGAGGTCCA GCAGATCCAC GGGGACCTGC ACCTGGGCCA GACGCTGCGC
ACCAGCCTGG GCTGGAAGAT CGTCGACTTC GAGGGCGAGC CGGCCAAGCC GCTCGCCGAG
CGGCTGCGAC CGGACTCGGT GTGGCGCGAC GTCGCGGGCA TGCTGCGCTC CTTCGACTAT
GTGCCCCGGG TGGTCGAGCG CCAGTTCGCC GAGGACCAGC CCGAGGGCGC CAGCCAGCGC
GCCTACCGCG CGGAGGAGTG GGCGCACCGC AACCGCAACC ACTTCCTGAC CGCGTACGCC
GGCGGCGAGC TCACCGAGGA GCAGCAGGCG CTGCTCGACG CCTATGTCGT GGACAAGGCG
GTGTACGAGA CCGTGTACGA GACACGAAAC CGTCCGACCT GGGTGGCCAT CCCGCTCGAG
GCCGTGGCGA GGATCGGAGC GGCATGA
 
Protein sequence
MTPDGRVTHI EPDVFVRYLE RTRWFGGKGR PFEVASVRRI GEVPRDEEDG GPRVVIELIE 
VAYSDGPGGT EVYQVPMSFY TEPESRLDHA FIGWWDEPGY GWVHAYDALH DRDAMDGWLR
AFDRAAREPG GNLSDDDSGL RFHRLPGHDL DLDAHSTLFS GEQSNSSVAF GEDCLMKVFR
KITPGVNPDI SVHEVLTNAG SDHIAALYGW LDWVDWEADP EAEERAGTTM QLAMLQQFLR
TASDGWDLAL TSARDLFAEA DLHAHEAGGD FAAEAARLGT ALREVHEDLA EHFPVERRGP
EALTELADAM SARLDAALEV VPELAAHTET LRATYDRVRG LGGLEVQQIH GDLHLGQTLR
TSLGWKIVDF EGEPAKPLAE RLRPDSVWRD VAGMLRSFDY VPRVVERQFA EDQPEGASQR
AYRAEEWAHR NRNHFLTAYA GGELTEEQQA LLDAYVVDKA VYETVYETRN RPTWVAIPLE
AVARIGAA