Gene Noca_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2973 
Symbol 
ID4595611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3158470 
End bp3160863 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content68% 
IMG OID639777578 
Productphage integrase family protein 
Protein accessionYP_924162 
Protein GI119717197 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCAGCG AGCAGCTGAA GGTCGCCGGG CCGTACTACC GCGAGCATCC CGAGCTGGTG 
GACCAGGTCT TCGCCGGGCC ACACGACACC GCGGTGAAGG ACGAGGTCTT GCGGCTGGTG
GCCAAACTCC CGGACTGGCC GGTGACCGAG TACGCGAGCA ATCGGCTGCT GGAAGGGGCA
GATTGGGTGC TCGACTGGCT GCTCATCTTC CCCGGGGCCG GTTGGCGCGA CCGATGGGTT
GCGGCCGGTG CCGACACGAA CCCGCTGGGC TGGCTGCCCG AGCGCCACGG CGAGGACCAC
CGCGACCCGA AGACGGTGCG CCAGATCGCA GTCGCAGGGA TGCGCACCCT GATCGTGTCG
CGGGTGGTCC TGCCCGGCCA CGAGTTCTTC GCGACCTGGA AGACCAAGGG CTATCAACAG
GTCCTGGTGC AGCAAGATCC CGCCCTAATC GCCCGGATCG AGGCGTTCGC CGAGCAGCAT
CAGATCAGCC GTCGCCAACA GCGCGACGTC TGGACGGTAC TGGCCAGGAT CATGCTCCAC
ACCGGCAAAG ACCTCCAGGA CGTCAGCGCC GAGGACCTCT TCGAGCTGCG GGCCCACTAC
CGGGAGAAGC ACGGAGTCCC CGCCGCCGGA CTGGGACAGA CCTGGACCCT GCTGGCCGGG
GTCGGCATCT TGCCCAAGGA CTCCTCCCTG CGTGCCGCGC TGCGCGTGGG GCAACGCACC
CCCGCCGAGC TGGTCGACCG CTACGGGCTC GCTCCCGGGC CCGTGCGTGA CCTGCTGGTG
CGCTACCTGG CCGAACGGAA GACCAACGTC GACTACACCA CCTTGAAGTC GCTCGCGCGG
ATGCTGGCCG GCAACTTCTG GGCCGACCTG GAGCGCCACC ACCCCGAACT CGCCGGGAGC
GACACCCTGG CCCTGCCCAA GACGACGGTG ACCGCCTGGA AGGAGCGCCT GGCCGTCATC
GTCGCCCCCG ACGGGTCGAG GAGGCCGCGA GCCGACTTCC TCGACGTGCT GCTCACCGTG
CGCTGCCTCT ACCTGGACCT ACGTGACTGG GCGCTCGATG ACGCCACGCT CGCACCATGG
GTGGTGGCCT CACCAGTCAC CCGCGCCGAC GTGGCCGGCA ACAAGAAACG GAAGCTGGCC
CATCAAGCCC GCATCCACCA GCGCATCCGC GAACGAGTGC CGCTGGTCCC GCGGATGCTC
GCCAGGCTGG AACAGGACCG CCGCGACGCC GAGGCGATGC TGGCCCTGGC CAAGCAGACA
CCTGCCGATG CCTCATTCAA CTTCAACGGT CGCACCTATC AGCGGATCAG CACCCGGATC
CGTGGCGGCC GCGGCCTACC TCCCGACCTG GCCTACGACG TGGTCTGCGT TGAGACCGGC
GAACGCTTCA GCGTCGGCCG GACCGAGGAG GACGCATTCT GGCTGTGGGC GGTCGTGGAG
ACGCTGCGAC ACAGCGGTGT GCGCCTGGAG GAACTCCTCG AGCTCACCCA CCTTGCGCTG
GTGTCTCACC GGCTGTCCAC CACCGGCGAG CTCGTCCCGC TGCTGCAGAT CGTCCCGTCG
AAGACGAACG AGGAACGACT GCTCCTGGTC ACCCCCGACC TTGCCCGCGT CCTGGCCGAG
ATCATCAGCA GACTGCGCAA CCGCTACAAC GGAGCCGTAC CGCTGGTCGC GCGCTACGAC
AACTACGAAG CCACCACCGG GCCACCACTG CCGCACCTGT TCCAGCGTGA CTACGGCTCC
GGTCCGACGG CGATGAGCCC AACCATGGTC CGAGTCACCC TGCGGGAGGT GGCCGACCGG
ATGGGCCTGA CCGACACCGC AGGCAACCCC ATGCACCTGA CCCCGCACGA CTTCCGACGG
GTCTTCACCA CCACTGCCGT CCAAGACGGA CTGCCGCTGC ACATCGCCTC GCGGATCCTC
GGGCACCGCC ACCTCAACAG CACCCAGCCC TACACCGCCG TCTTCCAAGA CGACCTAATC
CGCACCTTCC GAACCTTCGT CGATCGCCGC CGATCAGCGC GACCGGCCGA GGAATACCGC
GAGCCCACCC CCGAAGAGTG GGACGAGTTC GAGGAACACT TCGCCCTGCG CCGTGTCGAG
CTCGGCTCCT GTGCCCGGCC CTACGGGTCG AGCTGCGAAC ACGAGCACGC CTGCATCCGC
TGCCCCGTCC TGCGTGTCGA CCCCGCCCAG CTCGACAGAC TCGAAGCCAT CGCAGCCAGC
CTCCTACAAC GCATCACAGA AGCCGAGGAA CGCGGTTGGC CCGGTGAGGT CGAACAACTC
ACGATCAGCC TGCGCGCCGC CCAAGACAAG ATCTTGGCCA CCGCACCACC TCGCAAGGGC
AGCGTGTTCC TGGGAGATCC AGCCATACCC CCGACGAACG CGCCGCAAGC CTGA
 
Protein sequence
MTSEQLKVAG PYYREHPELV DQVFAGPHDT AVKDEVLRLV AKLPDWPVTE YASNRLLEGA 
DWVLDWLLIF PGAGWRDRWV AAGADTNPLG WLPERHGEDH RDPKTVRQIA VAGMRTLIVS
RVVLPGHEFF ATWKTKGYQQ VLVQQDPALI ARIEAFAEQH QISRRQQRDV WTVLARIMLH
TGKDLQDVSA EDLFELRAHY REKHGVPAAG LGQTWTLLAG VGILPKDSSL RAALRVGQRT
PAELVDRYGL APGPVRDLLV RYLAERKTNV DYTTLKSLAR MLAGNFWADL ERHHPELAGS
DTLALPKTTV TAWKERLAVI VAPDGSRRPR ADFLDVLLTV RCLYLDLRDW ALDDATLAPW
VVASPVTRAD VAGNKKRKLA HQARIHQRIR ERVPLVPRML ARLEQDRRDA EAMLALAKQT
PADASFNFNG RTYQRISTRI RGGRGLPPDL AYDVVCVETG ERFSVGRTEE DAFWLWAVVE
TLRHSGVRLE ELLELTHLAL VSHRLSTTGE LVPLLQIVPS KTNEERLLLV TPDLARVLAE
IISRLRNRYN GAVPLVARYD NYEATTGPPL PHLFQRDYGS GPTAMSPTMV RVTLREVADR
MGLTDTAGNP MHLTPHDFRR VFTTTAVQDG LPLHIASRIL GHRHLNSTQP YTAVFQDDLI
RTFRTFVDRR RSARPAEEYR EPTPEEWDEF EEHFALRRVE LGSCARPYGS SCEHEHACIR
CPVLRVDPAQ LDRLEAIAAS LLQRITEAEE RGWPGEVEQL TISLRAAQDK ILATAPPRKG
SVFLGDPAIP PTNAPQA