Gene Noca_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2235 
Symbol 
ID4598733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2381634 
End bp2382968 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content67% 
IMG OID639776835 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_923428 
Protein GI119716463 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATTGGT GCCAGCGAGC CCGAGCGTCA GACTTTCGAA TCTCCCTGCC CTCCTCACGA 
CGGACGCCTG GATCCGATCG CGAGGAGGAG ATGGCCATGG AAGTGATCCA CGTCCGGTGT
GCGGGCATGG ACGTGTCGAA GAAGGACGCC AAGGTCTGCG TCCGACATGC GGGAGCAGGT
CGGCGCAAGA CCGTGGAAAC GGTCACGACC TGGACCTCAA TGACCGGCCA GATCCTGGCG
CTGCGCGAGC ATCTGATCGC CGAGCAGGTC ACGTGTGTGG TGATGGAGGC AACCGGTGAC
TACTGGAAGC CGTTCTACTA CCTGCTCGAA GACCTGCCCG GTGTCGAGGT GATGCTGGTC
AACGCCCGCC ATGTCAAGAC CCTGCCGGGA CGCAAGAGCG ACGTCGCCGA CGCGACCTGG
CTGGCCCAGC TCGGTGCGCA CGGCCTGGTC CGGGCCTCGT TCGTGCCACC CGAACCGATC
CGGCAACTGC GGGACCTGAC CCGGGCACGG ACCGCGATCA CCCGCGAACG TGGCCGGGAG
GTCCAACGGC TGGAGAAGCT GCTGGAGGAC GCCGGGATCA AGCTGTCCGC GGTCGCCTCC
GACATCATGG GCGTCTCAGG ACGGGCCATG CTCGAAGCGC TGATCGCCGG CGACCGCGAT
CCCGCCGGGC TTGCCGACCT GGCCAGGCGT CGACTGCGGT CCAAGATCCC TGAACTGACC
GAAGCGCTCG CTGGCCGGTT CACCGAACAC CACGCGTTCC TCGCCCGGGT CCACCTGGAT
CTCATCGACC GACACACCGC CGCCGTCGAG CAGTTGACTG AGCGGATCGA GGTGGTGATC
GAGCCGTTTC AGGGCTTCCA CGACCTGATC TGCACGATCC CGGGAATCTC CACGATCACC
GCCGACATCA TCACCGCCGA GACCGGCGCG GACATGACCC GGTTCCCCAC TGCCAAGCAC
CTCGCCTCTT GGGCCGGGAC CACACCCGGC AGCAACGAGT CCGCCGGGAA GGTGAAGTCC
TCACGGACCA GGCCCGGGAA CCCCTACCTG CAGGGCGCAC TCGGGGCGGC CGCGATGGCG
TGCTCACAGA ACCGGACCAC CTACCTCGGC GCGCGCTACC GGCGGATCGC CAGCCGGCGC
GGCCCGCTGA AGGCCAACGT CGCGATCCAG CACTCCATGC TCATCGCGAT CTGGCACATG
GGCACCACCG GCACCCTCTA CGACGACCCT GGAGGCGAGT TCTTCAACCG CCTCCACCCC
GACCGCACCA AGATGCGAGC CATCAGCCAG CTCGAAGCCA TGGGCTACCG CGTCACCCTC
GACCACGCGA GCTGA
 
Protein sequence
MDWCQRARAS DFRISLPSSR RTPGSDREEE MAMEVIHVRC AGMDVSKKDA KVCVRHAGAG 
RRKTVETVTT WTSMTGQILA LREHLIAEQV TCVVMEATGD YWKPFYYLLE DLPGVEVMLV
NARHVKTLPG RKSDVADATW LAQLGAHGLV RASFVPPEPI RQLRDLTRAR TAITRERGRE
VQRLEKLLED AGIKLSAVAS DIMGVSGRAM LEALIAGDRD PAGLADLARR RLRSKIPELT
EALAGRFTEH HAFLARVHLD LIDRHTAAVE QLTERIEVVI EPFQGFHDLI CTIPGISTIT
ADIITAETGA DMTRFPTAKH LASWAGTTPG SNESAGKVKS SRTRPGNPYL QGALGAAAMA
CSQNRTTYLG ARYRRIASRR GPLKANVAIQ HSMLIAIWHM GTTGTLYDDP GGEFFNRLHP
DRTKMRAISQ LEAMGYRVTL DHAS