Gene Noca_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2030 
Symbol 
ID4598652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2174177 
End bp2175616 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID639776634 
Productintegrase catalytic subunit 
Protein accessionYP_923227 
Protein GI119716262 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA AGGACTGGAG CAAGAAGACC AGCGATGCGC CGGAGGGGCT TCGTCGGCAG 
TGGCGTGCTG ATCGGGCGCT GAGGCCGGCG ATGCGCTCGC CCGGGCGACC GGACCCGTCG
CGGGTGGTGC AGCGACAGTT CTGGCGGCAG ATCGCCACGG GCGTCACGAC GGTGGAGGCG
TCGATGGCCG TGGGCGTGTC GTGGCCGGTC GGTGCTCGCT GGTTTCGCCA CGCTGGCGGC
ATGCCGCCGA TCTCGCTGGC CGAGCCCACC GGCCGCCACC TGACCTTCGA GGAACGCGAG
GAGATCGCGA TCCTGCGCGC CAAGGACAAG GGCGTGCGCG AGATAGCCCG TGCGATCGGG
CGTGACCCGG GGACCGTCTC ACGCGAACTT CGTCGCAATG CAGCGACTCG TGGCGGCAAG
CAGGAGTACC GCGCTGGCGT AGCGCAGTGG AAGGCACAGC AGGCGGCGAA GCGTCCCAAG
ACCGCGAAGC TCGTGACCAA CGAGCGGTTG CGTGAGTACG TGCAGGATCG GCTCGCCGGC
AACGTCCGCC GTCCCGACGG CACGATCGTG CCGGGTCCGA CACCGCCGCC GTGGAAGGGC
CTGAACAAGC CGCATCGCCG GGACAGGCGG TGGTCGACGG CTTGGAGCCC GGAGCAGATC
GCCCAGCGCC TGAAGGTCGA GTTCCCCGAT GATGAGTCCA TGCGCATCAG CCACGAGGCG
ATCTACCAGT CGCTGTTCAT CGAGGGCCGC GGTGCGCTCA AGCGCGAACT GGTCACCTGT
CTGCGCACCG GGCGTGCGCT GCGGGAGCCG CGGGCCCGGT CACGGAACAA GGCACAGGGG
CACGTGACCG CCGATGTCGT TCTCAGCGAG CGCCCCGCTG AGGCAGACGA CCGGGCCGTC
CCTGGCCACT GGGAGGGCGA TCTGATCATC GGCACGGGTC GGTCTGCGAT CGGCACCCTC
GTCGAGCGCA GCAGTCGCTC AACGCTCCTG GTCCATCTGC CGCGACTGGA GGGCTGGGGT
GAGAAGCCGT ACGTCAAGAA CGGGCCATCA CTCGGTGGCT ACGGGGCCGT CGCGATGAAC
ACCGCGCTGA CCGCGTCGAT GACCAAGCTG CCCGAGCAGC TGCGCAAGAC CCTGACGTGG
GACCGTGGCA AGGAACTCTC GGGCCACGCG TTGTTCGCGG TGGCGACCGG CACGAAGGTG
TTCTTCGCCG ACCCGCACTC GCCGTGGCAG CGACCGAGCA ACGAGAACAC CAACGGCCTG
TTGCGCCAAT ACTTCCCCAA GGGCACCGAC CTGTCGCGCT GGTCCGCCGA GGACCTGGAG
GCCGTCGCCT ATGCGCTCAA CAACCGGCCC CGCAAGGTCC TCGGGTGGAA GACACCCGCT
GAGGTCTTCG AGGAGCAACT ACGCTCCCTT CAACAACCCG GTGTTGCATC GACCAGTTGA
 
Protein sequence
MATKDWSKKT SDAPEGLRRQ WRADRALRPA MRSPGRPDPS RVVQRQFWRQ IATGVTTVEA 
SMAVGVSWPV GARWFRHAGG MPPISLAEPT GRHLTFEERE EIAILRAKDK GVREIARAIG
RDPGTVSREL RRNAATRGGK QEYRAGVAQW KAQQAAKRPK TAKLVTNERL REYVQDRLAG
NVRRPDGTIV PGPTPPPWKG LNKPHRRDRR WSTAWSPEQI AQRLKVEFPD DESMRISHEA
IYQSLFIEGR GALKRELVTC LRTGRALREP RARSRNKAQG HVTADVVLSE RPAEADDRAV
PGHWEGDLII GTGRSAIGTL VERSSRSTLL VHLPRLEGWG EKPYVKNGPS LGGYGAVAMN
TALTASMTKL PEQLRKTLTW DRGKELSGHA LFAVATGTKV FFADPHSPWQ RPSNENTNGL
LRQYFPKGTD LSRWSAEDLE AVAYALNNRP RKVLGWKTPA EVFEEQLRSL QQPGVASTS