Gene Noca_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1914 
Symbol 
ID4596361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2041435 
End bp2042727 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content71% 
IMG OID639776512 
ProductCBS domain-containing protein 
Protein accessionYP_923111 
Protein GI119716146 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGTCT GGCTGCTGCT GGCCGCGGCC GTCCTGGTCG GGCTGGCCGG GCTGTTCTCG 
GCGACGGACG CCGCGGTGTC GTCGTTCTCC CGGGCCCGCG CCGAGGAGCT GCTCGCGGAG
GGCCGCCCGG GATCGAAGCG GCTGGTCGCG CTGCTCGACG ACCTGCCGCG CTACCTCAAC
ACCGCGCTCC TGCTGCGCCT GCTGTGCGAG GTCTCCGCGA TCGTGCTGGT CACCCTCGAG
GCCAGCAGTG CGTACGACGG CCGCGCGTGG CCGACCGCGC TGACGGTGAT CGGCGTGATG
CTGGTGGTCT CGTTCGTCGC GATCGGCGTC GCACCGCGCA CCCTCGGCCG CCAGCACTCC
GAGCGGTTCG CGCTGCTCTC GGCCGCGCCG CTGGCCACGG TGACGGCCGT GCTGGGGCCG
CTGCCCCGGT TGCTGATCCT GGTCGGCAAC GCGCTCACCC CCGGCAAGGG CTTCCGCGAG
GGGCCGTTCT CGACCGAGAC CGAGCTGCGC GAGCTGGTCG ACCTCGCCGA GGCCTCCGCG
GTCATCGAGT CCGGCGAGCG CAAGATGATC CACTCGGTCT TCGAGCTCGG CGACACCATC
GCCCGCGAGG TGATGGTGCC GCGCACCGAT GTCGTCTACA TCGAGCGGCA CAAGAACCTG
CGCCAGACGC TGTCGCTGTT CCTGCGCAGC GGCTTCTCCC GGGTGCCGGT GATCGGCGAG
AACCTCGACG ACGTCGTCGG CATCGCCTAC CTCAAGGACA TCGTGCGCCG CGACTTCGAG
GCGCCCGACG TCGAGTTCAC CGAGCGCATC GACGAGGTGA TGCGCCCCGC GCACTTCGTG
CCGGAGTCCA AGCCGGTCGA CGGGCTGCTC TCGGAGATGC AGGCCATGCG CCAGCACATC
GCGGTCGTCG TCGACGAGTA CGGCGGCACC GCCGGACTGG TGACGATCGA GGACGTGCTC
GAGGAGATCG TCGGCGAGAT CACCGACGAG TACGACGAGG CCACCGTCGA GGTGGAGAGC
CTCGACGACG ACGCCGTGCG GGTCTCCTCG CGCTACCCGA TCGACGACCT CGACGAGCTG
TTCGGCTTCG CGGTCGAGGA GGAGGACATC GACAGCGTCG GCGGTCTGAT GGCCAAGCAC
CTGGGCCGGG TCCCGATCCC GGGCTCGGTG GTCGAGGCGC ACGGCCTGCG GTTCGAGGCC
GAGGGTGCCT CGGGTCGGCG CAACAAGATC GGCACCGTGC TGATCAGCCG GGTGGGGCCG
GTCGACGACG AGAACGAGGA GAGCGATGAC TGA
 
Protein sequence
MSVWLLLAAA VLVGLAGLFS ATDAAVSSFS RARAEELLAE GRPGSKRLVA LLDDLPRYLN 
TALLLRLLCE VSAIVLVTLE ASSAYDGRAW PTALTVIGVM LVVSFVAIGV APRTLGRQHS
ERFALLSAAP LATVTAVLGP LPRLLILVGN ALTPGKGFRE GPFSTETELR ELVDLAEASA
VIESGERKMI HSVFELGDTI AREVMVPRTD VVYIERHKNL RQTLSLFLRS GFSRVPVIGE
NLDDVVGIAY LKDIVRRDFE APDVEFTERI DEVMRPAHFV PESKPVDGLL SEMQAMRQHI
AVVVDEYGGT AGLVTIEDVL EEIVGEITDE YDEATVEVES LDDDAVRVSS RYPIDDLDEL
FGFAVEEEDI DSVGGLMAKH LGRVPIPGSV VEAHGLRFEA EGASGRRNKI GTVLISRVGP
VDDENEESDD