Gene Noca_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2241 
Symbol 
ID4598739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2387334 
End bp2389304 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content73% 
IMG OID639776840 
Producthypothetical protein 
Protein accessionYP_923433 
Protein GI119716468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACCG TCATCACCGC CGTGCGCCGG GCGAGCCGAC TCGGCATCGC GGCCCTGGCG 
CTCCTGGTGC TCACCACGCT CCTCGGAGCT GCCTCGACCC TCCAGCCTCC CGCCGCAGCG
GCCAGCGAGC CGGCCGCCAG CACGGCGCAC CTCGCTCCAG TCGCCCCGGC GGCGACCAGC
GGCACTCAGC CGACCCGCAA CCTCAAGATG GGCTGCCCGG GGCCCGACGC CCTATGCGAC
CTCGGCAGCG ACGCCGTCGA CTGCGCCAAG GACCCGATCG ATTGCGGCAA GGACGCCGCC
GGCGATGTGA AGGACGGTGC CGGCGACCTG CTCGACGGCG CAGGAGATCT GCTCCCCGAC
GGCTGCGGGA TCCTCGACGC GATCTGCGGC AACATCGGCG GACTGCCCGG GCTTTCGGGC
GTCCCTGGGC TGCCCGGGAT CCCCGGTCTG CCGAACGTCG GTGACCTCTT CGGCGGCGGC
ATCCCCGGGC TGGGCGACAT CCCCAACCCG TTCGAGGCCA TCGGCGACGT CATCGCCAAG
GCCGCGGCCG ACGCCTGGAC CGCGGCCATG CTCGCGATCT GGAACTCCGG CCTGTTCGTG
CTGCGCATCG TGCTCACGTT CAGTGAGCTG TTCTTGACTC CGGACCTGAG CGCCGACGGC
CCGGGCAAGG ACGTCTACGC CTTCACCCTG TGGCTGGCGC TGGCCCTGGT GGTCATCTTG
GCGATGATCC AGCTCGGCGC CGCCGCCTTC AAGCGCGAGG GCAAGAGCCT CGCCCGGGCC
TTCATCGGGT CCGGCCAGTT CGTCTTGGTG TGCGCCAGCT GGTTCGGGTA CTGCGTCATG
ATCATCGCGG CCTGCGGGGC GCTGACCAAG GCGCTGATGA AGTCGCTGCT CAAGGTGCAG
ACCTGGCCCG ACTGGGACCC GCTCGGCGGA CTCGGCATCG ACGACATCAC CGACGCCGGC
GTGGCCACCG CGCTGGCATT CCTCGGGATC TTCCTGTGGC TGGCCGCCAT CGGGCACGTC
CTGGTCTACC TGGCCCGCGC GGCGTCCCTG CTGGTGCTCA CCGCCACGGG GCCGCTCGCG
GCCGCCGGCC TGGTCTCGGA GTTCACCCGC TCCTGGTTCT GGAAGTCGCT GCGCTGGTTC
CACGCCGCGG CGTTCACCCC GGTGCTGATG GTGATGGTGC TGGGCATCGG CGTGCAGTTC
GCCAACGGAG TCGCCGCCCA CCTAGCCGAG GACACCGCCA AGGCGTTCGG CACCGCGCTG
CCGGCCGTGA TGACGATCCT GATCAGCGTC GTCGCCCCGC TGTCCCTGTT CAAGCTCCTT
GCCTTCGTCG ACCCCGGCAC CCCCAGCGGC GCGTCCTTTC GCCAGGGCAT GGCCATCCAG
GGCGGCCTCC AGGGCCTGCT CAGCGGCGGC GGCGCGGGCG GAGGCTCGTC GGCTGCGTCG
ACCACCGACG CCAACGGCCG CTCCTCGGGC GAGCAGAGTG CCGAGGCCTC GACCGGCGAC
CGGTTCAGCA AATCGACCCA GGGCGCCCTG GGCAGCTTCG GCCCGGTGGG CCAGGCCCTG
TCGACCGGCA TGGGCTGGAT CAACTCCGCC GGCGCGAAGG CGACCTCGCT GATGTCGGAC
GAAACCAACC AGGCCGGTGT CGGCCAGAGC ACCTACGGCC CCGACTTCAG CGGCCTGAGT
GGACGGCAGT CCGGTGGCCA GTCCGGCGGC CAGGGCGGGA CCCACCCCGG GTCGCAGAAC
GGCGACCAGA GCGACGGGGA TTCGTCGATG CCGACCCCGC CCACGCCTCC TGCGCCGCCC
ACCCCGCCGA CGCTGCCCAC TGGCGGCGGA CCCGGCGGCG GTTCAGGTGG CCAGGGCGGC
AGGGGAGCTG ACGCAGCTCC CAAGACCCCG GCCGCCGGCG GCGGGGGAGC AGCAGGAGGT
GCCGGCGGCG CCGGCGCGGC CGCTGGCGGC ATTCCACCGG TGGCGGGGTA A
 
Protein sequence
MSTVITAVRR ASRLGIAALA LLVLTTLLGA ASTLQPPAAA ASEPAASTAH LAPVAPAATS 
GTQPTRNLKM GCPGPDALCD LGSDAVDCAK DPIDCGKDAA GDVKDGAGDL LDGAGDLLPD
GCGILDAICG NIGGLPGLSG VPGLPGIPGL PNVGDLFGGG IPGLGDIPNP FEAIGDVIAK
AAADAWTAAM LAIWNSGLFV LRIVLTFSEL FLTPDLSADG PGKDVYAFTL WLALALVVIL
AMIQLGAAAF KREGKSLARA FIGSGQFVLV CASWFGYCVM IIAACGALTK ALMKSLLKVQ
TWPDWDPLGG LGIDDITDAG VATALAFLGI FLWLAAIGHV LVYLARAASL LVLTATGPLA
AAGLVSEFTR SWFWKSLRWF HAAAFTPVLM VMVLGIGVQF ANGVAAHLAE DTAKAFGTAL
PAVMTILISV VAPLSLFKLL AFVDPGTPSG ASFRQGMAIQ GGLQGLLSGG GAGGGSSAAS
TTDANGRSSG EQSAEASTGD RFSKSTQGAL GSFGPVGQAL STGMGWINSA GAKATSLMSD
ETNQAGVGQS TYGPDFSGLS GRQSGGQSGG QGGTHPGSQN GDQSDGDSSM PTPPTPPAPP
TPPTLPTGGG PGGGSGGQGG RGADAAPKTP AAGGGGAAGG AGGAGAAAGG IPPVAG