Gene Noca_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1100 
Symbol 
ID4599580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1159085 
End bp1160236 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content68% 
IMG OID639775696 
Productphage integrase family protein 
Protein accessionYP_922303 
Protein GI119715338 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGAA ACATCGCCAA GCGAGCCAAC GGCAAGTGGC GTGCGCGGTA CCGCGACGAG 
GCCGGCAACG AACGCGCCCG GCACTTCGAC CGCAAGATCG ACGCCCAGCA GTGGCTGGAT
CAAGTCACCT CGGCGGTAGT CACCGGCACG TACGCCGACC CCAAGGCCGG CCGGATCACG
TTCGCGGCCT TCTTCGGCGA GTGGTCGGCC CGCCAGGTCT GGGCACCCGG CACCGTGCTC
GCGATGTCAC TGGCGGCGAG ATCCGTGCCC TTCGCGGGGA AGCCGATGAA GCAGGTCCGG
CGCTCGGACG TCGAGACCTG GATCAAGCAG ATGAACGCCG CCGGACTCGC CCCCGGCACG
ATCAAGACGC GCTACGTCAA CGTCAGATCA GTGTTCCGAG CCGCCGTGAA GGACCGGGTG
ATCGGCTCCG ACCCGACCGA CGGCGTACGC CTTCCCCGCG GCCGTCGCGC GGACGTCGGC
ATGTCGATCC CCGCGCCGGA GGAGGTGAGG CAGCTCATGG CCGTGGCTGA CGAACGCTTC
CAGCCGTTCA TCGCCCTCTG CGCCTTCGCC GGGCTGCGGT TGGGTGAGGC CGCCGGGGTC
CAGCTCGGCG ACGTCGACTT CCTCCGCAGG TCGCTGAAGG TCTCCCGCCA GGTGCAGCGC
GTCAATGGTG GGGCGATTGA CGTACGGGCA CCGAAGTACG GCTCAGAGCG CGTCGTCTAC
CTCGCCGACA GTCTCGTCAA CGTGCTCGCC GAGCACGTCG GCGCTCACGG CACCACCGGC
AAGGCTCGGT GGCTCTTCGC CGGGGAGGGC GACGACCCAC CGCACCAGAA CACCATCGGC
TACTGGTGGC GGAAGACGCT GCGCGACGCC GGCCTGTCCG GCATCAAACT CCACGACCTG
CGGCACTTCT ACGCCTCCGG GCTCATCGCG GCCGGGTGCG ACGTTGTGAC CGTCCAACGA
TCGCTCGGGC ACGCGAAAGC GACTACGACG CTCAACACCT ACGCACACCT CTGGCCGACC
GCTGAGGACC GCACACGTAA GGCTGCGGAG TCGATCATGG CCGCGTCGCT GGGCAAGCCG
GCCGCGATCC TCGCCGAGGT TGGAGGCGAG TACGGGTCAG TGAGCCATGC ATCTGATCGC
AGATGTACTT GA
 
Protein sequence
MAGNIAKRAN GKWRARYRDE AGNERARHFD RKIDAQQWLD QVTSAVVTGT YADPKAGRIT 
FAAFFGEWSA RQVWAPGTVL AMSLAARSVP FAGKPMKQVR RSDVETWIKQ MNAAGLAPGT
IKTRYVNVRS VFRAAVKDRV IGSDPTDGVR LPRGRRADVG MSIPAPEEVR QLMAVADERF
QPFIALCAFA GLRLGEAAGV QLGDVDFLRR SLKVSRQVQR VNGGAIDVRA PKYGSERVVY
LADSLVNVLA EHVGAHGTTG KARWLFAGEG DDPPHQNTIG YWWRKTLRDA GLSGIKLHDL
RHFYASGLIA AGCDVVTVQR SLGHAKATTT LNTYAHLWPT AEDRTRKAAE SIMAASLGKP
AAILAEVGGE YGSVSHASDR RCT