Gene Plim_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1047 
Symbol 
ID9137733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1322968 
End bp1324461 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content60% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003629086 
Protein GI296121308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.183518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTA CTTCCAGAGC CCTCGGCAGA GATAGATCAT TTGAACTCGT GAAGCAGTCT 
TTCTGGCAGG ACGAAGGTCT GCCGTTCTCG GATGCGCTGA CAACGCGGCA GTTGGAAGAG
GTTTTTGAGG CCGAAGAGGT CTCGTTTGGA AGAGACCCGT GCGTAAGCGA ACAGGCATCG
ATCGAGGATG GCGGGCTGGT CTACACACGC GGCGTGACGT TATGGGCCAT GCTCTCTCAA
GCCCTCTTCA CCGACGTTCA ACGAGCCTGT CGCGCGGCGG TTCAGCGCGT GGCGGTGTAC
TACGCTCTAT CGGGCATCAG AATCTCCTCG ACGAACACCG GTGCCTACTG TCGCGCGCGG
GCCAAGATTC CGGAAGGTGT CGTCCAGCGA CTGGCAGTCG GCGTCGGCCA GAGGTGTGAG
GCAGCGGTTC CCGACAAGTG GCGCTGGCAT GGATTCCGCA CGCTGGTCAT TGATGGCACC
ACATGCTCGA TGCCGGACAC CCAGGAGAAT CAGGCGGAGT ACCCTCAACC CTCTTCGCAG
GGGAAAGGCT TGGGATTTCC CATCCTGCGG GCCGTGGCCC TGACATCGCT CGCGACAGGG
ATGATTCTGG CTCTGGTGAC CGGTCCCTGT GCAGGAAAGG CGACCGGTGA GACGGCTCTG
TTTCGAACGT TGTTCGATCA GTTGAAAGCG GGTGATCTGG TGCTGTCAGA TCGGTACTAC
GGCGGCTGGT TCATGCTGGC ACTGCTGCAA GAACTGGGGG TCGAGTTTGT AACTCGGCTG
CACCAGTTTC GGATTGCAGA CTTCCACCAG GGGAAACGGC TGGGCCAGAG AGATCACGTC
GTGGCCTGGG CCAAACCGCA AAAGCCCGCG TGGCTCGATC AGGCAACCTA TGATCGTCTG
CCCGATCAGT TGGAAGTCCG TGAGATCGAG GTGCAGGTCC CCGTCCCCGG CTTCCGCACC
GCCTCCCTGG TGGTGGTCAC GTCGCTGCGA GATCACAGAC GTTTTCCACG GGAGGAACTG
GCCCTGCTCT ACCGCCGCCG GTGGACTGTG GAACTCGAAC TGCGAGACAT CAAGGCCACG
ATGGATCTGG CCGTCCTGCG CTGCACGAAA CCGGCATGGG TGCGACAAGA ACTCTGGACG
GGCCTGTTGG CGTATAACCT GATCCGTCAG TCCATGCTGC AGTCGGCACT GGGCGGCGAA
GTCCGACCCG AACAGTTGAG CTTTGCCGCA TCCTTAACAA ATGCTGGCCA ATATGTGGTT
GCTGGCCGCG ATGCCGCGCG ACCATACGAG AACCGATGTA GAACTCCTCA TTGTGCTGCG
AATGATCAAC GGTTATTCGC ATCGTGTCGG CCACCGCCCG GATCGAATGG AGCCCCGCGC
GGTCAAACGC CGCCCCAGTC CCATCGCCCT GCTCGCCGCA CCCCGCGAGG CCGCTCGCAA
TCAAGTCCTT GCGGGTATCA ATGGAAAGTG GTCAACGCGA TGACGCGTTG TTAA
 
Protein sequence
MSFTSRALGR DRSFELVKQS FWQDEGLPFS DALTTRQLEE VFEAEEVSFG RDPCVSEQAS 
IEDGGLVYTR GVTLWAMLSQ ALFTDVQRAC RAAVQRVAVY YALSGIRISS TNTGAYCRAR
AKIPEGVVQR LAVGVGQRCE AAVPDKWRWH GFRTLVIDGT TCSMPDTQEN QAEYPQPSSQ
GKGLGFPILR AVALTSLATG MILALVTGPC AGKATGETAL FRTLFDQLKA GDLVLSDRYY
GGWFMLALLQ ELGVEFVTRL HQFRIADFHQ GKRLGQRDHV VAWAKPQKPA WLDQATYDRL
PDQLEVREIE VQVPVPGFRT ASLVVVTSLR DHRRFPREEL ALLYRRRWTV ELELRDIKAT
MDLAVLRCTK PAWVRQELWT GLLAYNLIRQ SMLQSALGGE VRPEQLSFAA SLTNAGQYVV
AGRDAARPYE NRCRTPHCAA NDQRLFASCR PPPGSNGAPR GQTPPQSHRP ARRTPRGRSQ
SSPCGYQWKV VNAMTRC