Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4111 |
Symbol | |
ID | 5714619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009958 |
Strand | + |
Start bp | 41989 |
End bp | 44979 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641277013 |
Product | integrase family protein |
Protein accession | YP_001542309 |
Protein GI | 159046640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.161486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTAT GCTTTGACCG CGATACAGTG ATGGCTGTCG AGATCGTAGA TGAGCTGCTA CGCGGCGATG CCGTCCCAGA AGACTGGACT GTTGCGATCG AGACCGCAGT GGCGGACCGC CGTCCTGACC TCGGCCCCAA AGACCGTAAC CGCATCGTCG CCCGCGTGAT CGAGCGGGTA AACGACCGGC GGGGTCTGTC CCTCAAGAAA TCACATCTTC ATCGCCCACC GGCCGCCATT CCTCCGCTCC GGGATGCCGA ATGGCCTCCG CAGGCGATGG AGATCATCCA CCTGAGGCAC CGCCTGGATG AGATGCTGAC AGCGGCCGGG CGCTCCACGC TCGGGGCCGA CTTGCCGCGC CTCATTCTGG CCTGTGCTGC CTTGCGTGGG GGGCTTTGTC GTTTGGAAGG TTTGCATGCG TTGGCGCGTG CAGTTGGCTC GGGGGAATTG ACACTCACTG GCGCCGACGC GTTTCCCAAC CTCGTCTGGA TCGACTTGGA TCTGCAAATC GCAAAGGGGA CGCAGCAGCA CACCAACACA AATCCCGGCA GCGAATACCT GCGCTGGTTC CCCGATGTCA CAACGCTGGC CCTGATCGAT CGATGGCGCC GGCTGGGGCC CCTGACCTAT CACGTCCCGT CCTCTCCTGC AGCGCTCTAC ACCGACATGC TGGACCTGTT GGGGGCGCCG GAGCTGAAGA AACAAGTACC GGCCACCCGG TTCCCGCTCG TAGCGCTGGC TGCGCTCGAA CATGTCTCGG GCGTCGCCCT CCCGGTCAGC GTCGAGGCCG TGGCGTCCGG GCATAGCGTT GGCCTCTCGG TGCTGCCTGA GACGTGGGAC GCCCTGTGCA CGCGTGGTCA GCCTTTGAAG GAGCCTCCGG ACACTCGAGG CGCGGCCAGG ACAGCGCAAC CGCAACCCTC CCGCACCCCA AAATCAGCTG CTCCGCGCCC TGCTCCTGCG GATCATGCCA AGGCGCAGCT ACTGGAGGCA CTGAAAGAGT GCCTCGACAC CCATGCGGCA ACGTCCTCGG TCAAGGGGCG CAAAGCAATG TCTGATGCCC TGCAGACCCT TCAACAGGAG GCCGTCGGGA TCTGGCACTC CCTCGAAGCT CTGGTCAGCT GGTATCGGAC CCTGCTTCGC AACGGCGACA AGCCACGCAC CGTGAAGGAC TATCATTCCA TCGTGGGTCG GCACGTACTG GCGGTCATGA CCGAGGACGA CCCGAGGAAC CTCACACCAG AGGCTCTGCG AGACATGTTC GAGCAGGTTC TGGCGCGCTC ATCCTACGCC AATCCGGCCC ATCCGCTCGG CCGCCTCGCG CATTTTGCCC GCCACGCAGA GCGGGTCTTG GATTGGCCCG ACGCTGATTT TTCCGGCTTT GGAGATCAGG GCAAAGCCAG CGCCACATTC GTCCGTACCG CTGCCCTGCC CATGGCCATC TATCCCGAGG TGTTTGACGC GATCCTGCAG ACCACCGACC TGAGCCCGGA CATGGCCGAG TGTTACGCCG TTGCGTTCGC ACTGGTTGCC TGGGGCGGCT TGCGGATCGA CGAGGCCCAT GGGCGCGTGG TCGATGACAT CGGGACTGAC TTGACGGTAT TTGTGCACGC CACAAGCAGT CACGGTCTGA AATCGGAGGC CGCGCGCCGG CTGATCCCCC TCGCCCTCTT CGCGCCGCCA GAGGTCGTCG CACAGGTCAA AAGCTTCGTG GCGCGCCGGG CGACCGTACC GGACAAGACC CGCGACAAGC TTCTCGATCT GGGCACGCTC TTCCCCGGCG ACCGCTTCGA CACAGGGACG TTCCGCACGC TACTGCAACG CACTCTTGGC CAGATGCTGG GGATCGCGGT GAGGCCGCAT GATTTCCGGC ACACGCTGAT CAGTGCCCTG CAGCTTCTGT TCCATCTTGG TGCAGATGCA GTTGACACGA TCGAAGCCGT GAGCGGCTGG AACGCGAAGC GGCAAAAACG CATCCGGCAC GCGCTGCTCG GCGTCTGTCC GGACCCGCGC CGGGCCCCAA GGCAGATCTC GGCGTTGGCC GGGCACCGCG ATCTCAGCGC GATCAGCGGC ACCACCTATT GCCACTTGAC CGACCTGGCG CTTGGATGCC TGATCCAGAG CGCCGAGGAG CGGTTGCCGG CCTCTGTTGC CGCTCGGTGG CTTGGCCTCA ACCGTCGGAG CCTGCGACCC TTCATCGACG ATAATGATAC GGTCCGGCTG GAGGACTTGC GCGGGCCACT CGTGAATAAG ATGGATATCG CGTGGCGCAC CACGCGAACG GGACAGTTAC CCTCACCCGA GCCTCGCGTA GCAAAGCCGG TCAGGGTCAC GCCGACGGTC GTCCATGCGG TCCTCAAAAT GGCCGAACAT GACACGGACA CGCTGACCAT TGCCGATCAG CTCAACATCA TGCGGGATCA GGTGACCCAC ATGCTGGCGA CCGCCGAAGA CCTGATGGCG GCAACCACCC AGAAAAAGGG GGCCCGCCAT GTTGGAGCTC ACGACCAGGA CCGCCCCAAC TGGCCAGTCG CCCGTTTTGC AGATCAGCCG GGCGAAGTGC GATTCCTGGC AAGGCTGTTG GCCGATGCAG AGACATTGCC GCCTGAGAAT ATGCTGAAAT GGTCTATGGT GGTGCTGACC CATGCGGACC GACATCGGCC TTGTCTACGT TTCCTTGGGG CCCCCGCCGC ACAGAATTGG CTAGAGCTGT TCCCAAAGGC CTTCCCCAAG TCCTCTCTGG ACATCCTGAT CCGATTCCCG TCGGATGTGA CGCCCGAGGC TGCGGAAAAG GCTTGGTGCA CGGCCATAGG TGCCAAGGGG AACTACCAGA CGGAGTCCAG TGGTCATGGT ACCAGTGTCT GCCCGACAGC CCCCGGACTG ATCATGCTGC GGAAGGTTTC GAAAAAGGCC AAGGAGAGCC GTTCCTTCTC AACCCTCCGC CGTGCCGCGT TTTGTATCGC GGTTACCTGT CGCGTGACTG ATGAGGCTTG A
|
Protein sequence | MNVCFDRDTV MAVEIVDELL RGDAVPEDWT VAIETAVADR RPDLGPKDRN RIVARVIERV NDRRGLSLKK SHLHRPPAAI PPLRDAEWPP QAMEIIHLRH RLDEMLTAAG RSTLGADLPR LILACAALRG GLCRLEGLHA LARAVGSGEL TLTGADAFPN LVWIDLDLQI AKGTQQHTNT NPGSEYLRWF PDVTTLALID RWRRLGPLTY HVPSSPAALY TDMLDLLGAP ELKKQVPATR FPLVALAALE HVSGVALPVS VEAVASGHSV GLSVLPETWD ALCTRGQPLK EPPDTRGAAR TAQPQPSRTP KSAAPRPAPA DHAKAQLLEA LKECLDTHAA TSSVKGRKAM SDALQTLQQE AVGIWHSLEA LVSWYRTLLR NGDKPRTVKD YHSIVGRHVL AVMTEDDPRN LTPEALRDMF EQVLARSSYA NPAHPLGRLA HFARHAERVL DWPDADFSGF GDQGKASATF VRTAALPMAI YPEVFDAILQ TTDLSPDMAE CYAVAFALVA WGGLRIDEAH GRVVDDIGTD LTVFVHATSS HGLKSEAARR LIPLALFAPP EVVAQVKSFV ARRATVPDKT RDKLLDLGTL FPGDRFDTGT FRTLLQRTLG QMLGIAVRPH DFRHTLISAL QLLFHLGADA VDTIEAVSGW NAKRQKRIRH ALLGVCPDPR RAPRQISALA GHRDLSAISG TTYCHLTDLA LGCLIQSAEE RLPASVAARW LGLNRRSLRP FIDDNDTVRL EDLRGPLVNK MDIAWRTTRT GQLPSPEPRV AKPVRVTPTV VHAVLKMAEH DTDTLTIADQ LNIMRDQVTH MLATAEDLMA ATTQKKGARH VGAHDQDRPN WPVARFADQP GEVRFLARLL ADAETLPPEN MLKWSMVVLT HADRHRPCLR FLGAPAAQNW LELFPKAFPK SSLDILIRFP SDVTPEAAEK AWCTAIGAKG NYQTESSGHG TSVCPTAPGL IMLRKVSKKA KESRSFSTLR RAAFCIAVTC RVTDEA
|
| |