Gene Noca_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1966 
Symbol 
ID4599873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2096111 
End bp2097322 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID639776566 
Productphage integrase family protein 
Protein accessionYP_923163 
Protein GI119716198 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00610288 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACGA AGATGAAGGA CGGCATCACG CAGCGCGGCC GCGGGTGGAC CTACGTGGTT 
CGCGAGCGTG ATCCCGAAAC AGGAAGGTCT CGGCAGCGGT GGGTCTCGGG TTTCACCACG
CGCGCAGAAG CCAAGGCTGC GCGGGACGCC GCTCGGCATG CGCTACACAG CGGAACCTAC
GTCGCCCCGC AGGACATCAC GGTCGCCGAG TGGCTCGACC GATGGATGGA CGGGCACGAG
GTGACGCTGA AACCCTCCAC CGCAAAGACC TACAGGGACA AGATCCGGCT CTACCTCAAG
CCGAAGCTTG GATCCGCACG CCTCCAGAGC CTCTCCCCGA GCGGCTTGAC CGTGGTTTGG
CGAGACCTCC AAGCGTCAGG CGGACGAAGT GGGGCACCGC TCTCGCGTCG CACCGTCGAG
TTCGCTCGCG CCGTCCTGCA TGCAGCGATG CAGGACGCGG TGGTCGAGCG CATCATCCAG
GTGAACCCCG TCGACGGCTC AAAGATGGCC AAGCGCGACG GCAAGCCGAA GCACACGACC
TGGACAGCGG CACAGGTGGC CTCCTTCCTC ACAGCGACCA CAAGCGACCG GTGGCACCCG
ATGTGGGCTG TGTTCGCCTC CACAGGTATG CGGCGAGGCG AGGTCGCCGG GCTGCGATGG
AGCGATTCTC ACGGCCCCAT CGTCGATCTG GATGGCGGGA CGATCCGGGT CGAGGTGTCG
ACGACGCAAC TCGACAACCG AAGGGTGACG ACAACGCCCA AGAATCATGA GCGCCGGACC
ATCGCCATCG ACCCGGACCT CGTCACCGTC CTTCGGACCT GGAAGGCAAC ACAGGCCGCC
GAACGCCTGG CCTTCGGCCC AGGCTATGCC GACGCGGAGG GCATCGTCTT CACATGGGAG
GACGGGCGAC CCGTCATGCC CGACTACATC AGCAAGACCT TTCTGACGGC CCAGACCAAG
CTCAAGGAAC TAGCGGCAGG ACCGGGTGAC GATGCCCCGG CCACGATCGT GCCGTTGCCG
AGACTCGTCC TGCACGGCCT CCGTCACACC CACGCCACCA TCCTGCTGCG CTCGGGCGTC
CCGGTTCACG TCGTCGCTAG GCGGCTGGGC CACAAGGACC CCTCGGTGAC GCTGGATACC
TACGCCGACG TCATCCCGGA CGACGATTCG AGCGCCGTCG ACGTGTTCTG GAACGCCGTA
TGGGGGGCAT AG
 
Protein sequence
MATKMKDGIT QRGRGWTYVV RERDPETGRS RQRWVSGFTT RAEAKAARDA ARHALHSGTY 
VAPQDITVAE WLDRWMDGHE VTLKPSTAKT YRDKIRLYLK PKLGSARLQS LSPSGLTVVW
RDLQASGGRS GAPLSRRTVE FARAVLHAAM QDAVVERIIQ VNPVDGSKMA KRDGKPKHTT
WTAAQVASFL TATTSDRWHP MWAVFASTGM RRGEVAGLRW SDSHGPIVDL DGGTIRVEVS
TTQLDNRRVT TTPKNHERRT IAIDPDLVTV LRTWKATQAA ERLAFGPGYA DAEGIVFTWE
DGRPVMPDYI SKTFLTAQTK LKELAAGPGD DAPATIVPLP RLVLHGLRHT HATILLRSGV
PVHVVARRLG HKDPSVTLDT YADVIPDDDS SAVDVFWNAV WGA