Gene Noca_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1123 
Symbol 
ID4599376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1188975 
End bp1190732 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content73% 
IMG OID639775719 
Producthypothetical protein 
Protein accessionYP_922326 
Protein GI119715361 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.12304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTCGAT CAGCTGACCG CCGCCGCGAT GACGGGTTCA CCCTCATCGA GATCATCGTC 
GCGCTCGGCG TGATCATGAC GGTGATGGCC GCCGTGCTGC CCCAGCTGCT GGTCGGCATC
AGGTCCGGCG AGACCTCCCG ACTGGTGACC CAGACCAAGG GCGTCGCCCA GGGCCAGCTC
GAGCGGATGC GCAACCTGCC GTATCACGTC GCCCCCGAGG CCGGCGACTA CCGCGACGTG
CTCGACTACT ACTTCCGCAA CCTCACCACC CCCGGACCGA TCACCTGCAC CGACCCGGAC
GGGCTCGCGA TGCCGACGAC CGCGTGGACC GGCTACGTCC CCGCCGACGG CGCCCGCTGC
TCCTATGAGC CGCAGTCCGG GGAGATGTAC CGCTACGTGG TCCCGCACCC GGCCACCGGG
ACCGATCCGC TCGCGGGCTT CCAGGTCGTC GTCGACACCC AGTTCCTCTC GGCGCCGAAG
TCCGACGGGA GCTCCGACGT GCTCGCGCCG CCCTCGGGCT ACAACACCCA GAGCGCGGGG
CACGACAGCC CGGTCTCCTC CCAGATCGGC GTCACCGTGA CCGTCCTCTA CGACCGGCAG
GGCATCACCC GGCCGGTCAC GACGTACTCC CAGATCGCCG ACCAGCCGGT CGCGGCCAGG
CGGATCGACC TCAGCGCGTC CGCGGCCGCG GTCGACATCG GCTCGATCAC GCCGACGAAC
GGCGCCGAGT CACTGCAGGC CGGGCTGCTC AGCCTCTCCG GTGCGCTGAC CTTCGCGAGC
ACGGCCAACG CCAGCCTGAC GGCGGCGTCC GCCGGGCTCG CGACCGGGGA ACAGGGGGCA
GGCGCCTCGA CGACCGTCGC GGCGCCCTCC GCCGTGGGCA TCCTGCCCGC CGCTGCCGGC
GGGATCGACG GGACCTGCGG GCTCGCGTGC TGGGGCGCCA GCCAGGTCGA CCTCGGCGCC
GTGACCGCGA CCGACGGGCT CCCGAACGTC GGCTCGGCCG CCAACCCGAT GCAGGCCCGG
CTCACCGACC TGACCAACCT GGGACTCTCC TTCGAGAGCG GCGCTGCCGC GGACTACCGG
ACCGGGCTGG GCCTCAGCCG TCCGCTGGTC CGTCTCGACG CCGGCGCCAC GGCCACGAAC
AGCGGGGTCA GCGCGACCTG CGCGCCCTCC GGCACCGGGG CACCGGCCCT GGTCCGGTCC
TCGGGCTTCC TGCGCACGAC CCCGATGACC GACGCCACCC CCACCGCCGA GGCCTGTGCG
GTGAGCTCGG CGAGCACGAT CTCCCTGTTC CCGACCTCGT TCGCTCCCGA CGGAGTGCTC
CAGGTGGAGC TGGTCCGCGC CACCGCCCGC TGCGTGGTCA GCGGCGCCGG CCACGTGGCG
CAGCCGCCGA CGTTCGACTA CCGGGTCGTC GTACGCCGGC ACGTCCCCGG CACCGAGGCC
GCGCCGGCCG GAGGCTACGA CGACGTGCTT GCGATCACCC CCTCGCTCAC CGCCGACGAC
CTGGCAGCGA TCGACCCGGC GTCCTTCGAC GTCGGCGGCG GCCACACCCT GGCCGACTAC
GTCGCGTCCT GGTCGGCGCT GGTGCCGGGC ACCGTCGAGA CGACCGCCGC GAACGGGCTG
TCCGCGGTGA CGCTGTCCGG GGTCCTCAAG CTCACCAGCC AGCCGATGCG GGTGCTGCCC
GACAGCACCG TCGACCCGGC CTCGGCCGTC TCGCTCACCC TCGGCCAGGT CGGCTGCTCG
GCCCTGGACG CGCGATGA
 
Protein sequence
MSRSADRRRD DGFTLIEIIV ALGVIMTVMA AVLPQLLVGI RSGETSRLVT QTKGVAQGQL 
ERMRNLPYHV APEAGDYRDV LDYYFRNLTT PGPITCTDPD GLAMPTTAWT GYVPADGARC
SYEPQSGEMY RYVVPHPATG TDPLAGFQVV VDTQFLSAPK SDGSSDVLAP PSGYNTQSAG
HDSPVSSQIG VTVTVLYDRQ GITRPVTTYS QIADQPVAAR RIDLSASAAA VDIGSITPTN
GAESLQAGLL SLSGALTFAS TANASLTAAS AGLATGEQGA GASTTVAAPS AVGILPAAAG
GIDGTCGLAC WGASQVDLGA VTATDGLPNV GSAANPMQAR LTDLTNLGLS FESGAAADYR
TGLGLSRPLV RLDAGATATN SGVSATCAPS GTGAPALVRS SGFLRTTPMT DATPTAEACA
VSSASTISLF PTSFAPDGVL QVELVRATAR CVVSGAGHVA QPPTFDYRVV VRRHVPGTEA
APAGGYDDVL AITPSLTADD LAAIDPASFD VGGGHTLADY VASWSALVPG TVETTAANGL
SAVTLSGVLK LTSQPMRVLP DSTVDPASAV SLTLGQVGCS ALDAR