Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1966 |
Symbol | |
ID | 4599873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2096111 |
End bp | 2097322 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639776566 |
Product | phage integrase family protein |
Protein accession | YP_923163 |
Protein GI | 119716198 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00610288 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACGA AGATGAAGGA CGGCATCACG CAGCGCGGCC GCGGGTGGAC CTACGTGGTT CGCGAGCGTG ATCCCGAAAC AGGAAGGTCT CGGCAGCGGT GGGTCTCGGG TTTCACCACG CGCGCAGAAG CCAAGGCTGC GCGGGACGCC GCTCGGCATG CGCTACACAG CGGAACCTAC GTCGCCCCGC AGGACATCAC GGTCGCCGAG TGGCTCGACC GATGGATGGA CGGGCACGAG GTGACGCTGA AACCCTCCAC CGCAAAGACC TACAGGGACA AGATCCGGCT CTACCTCAAG CCGAAGCTTG GATCCGCACG CCTCCAGAGC CTCTCCCCGA GCGGCTTGAC CGTGGTTTGG CGAGACCTCC AAGCGTCAGG CGGACGAAGT GGGGCACCGC TCTCGCGTCG CACCGTCGAG TTCGCTCGCG CCGTCCTGCA TGCAGCGATG CAGGACGCGG TGGTCGAGCG CATCATCCAG GTGAACCCCG TCGACGGCTC AAAGATGGCC AAGCGCGACG GCAAGCCGAA GCACACGACC TGGACAGCGG CACAGGTGGC CTCCTTCCTC ACAGCGACCA CAAGCGACCG GTGGCACCCG ATGTGGGCTG TGTTCGCCTC CACAGGTATG CGGCGAGGCG AGGTCGCCGG GCTGCGATGG AGCGATTCTC ACGGCCCCAT CGTCGATCTG GATGGCGGGA CGATCCGGGT CGAGGTGTCG ACGACGCAAC TCGACAACCG AAGGGTGACG ACAACGCCCA AGAATCATGA GCGCCGGACC ATCGCCATCG ACCCGGACCT CGTCACCGTC CTTCGGACCT GGAAGGCAAC ACAGGCCGCC GAACGCCTGG CCTTCGGCCC AGGCTATGCC GACGCGGAGG GCATCGTCTT CACATGGGAG GACGGGCGAC CCGTCATGCC CGACTACATC AGCAAGACCT TTCTGACGGC CCAGACCAAG CTCAAGGAAC TAGCGGCAGG ACCGGGTGAC GATGCCCCGG CCACGATCGT GCCGTTGCCG AGACTCGTCC TGCACGGCCT CCGTCACACC CACGCCACCA TCCTGCTGCG CTCGGGCGTC CCGGTTCACG TCGTCGCTAG GCGGCTGGGC CACAAGGACC CCTCGGTGAC GCTGGATACC TACGCCGACG TCATCCCGGA CGACGATTCG AGCGCCGTCG ACGTGTTCTG GAACGCCGTA TGGGGGGCAT AG
|
Protein sequence | MATKMKDGIT QRGRGWTYVV RERDPETGRS RQRWVSGFTT RAEAKAARDA ARHALHSGTY VAPQDITVAE WLDRWMDGHE VTLKPSTAKT YRDKIRLYLK PKLGSARLQS LSPSGLTVVW RDLQASGGRS GAPLSRRTVE FARAVLHAAM QDAVVERIIQ VNPVDGSKMA KRDGKPKHTT WTAAQVASFL TATTSDRWHP MWAVFASTGM RRGEVAGLRW SDSHGPIVDL DGGTIRVEVS TTQLDNRRVT TTPKNHERRT IAIDPDLVTV LRTWKATQAA ERLAFGPGYA DAEGIVFTWE DGRPVMPDYI SKTFLTAQTK LKELAAGPGD DAPATIVPLP RLVLHGLRHT HATILLRSGV PVHVVARRLG HKDPSVTLDT YADVIPDDDS SAVDVFWNAV WGA
|
| |